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Preface 


This book is derived from class-tested courses presented over many years. It de- 
livers the foundation for an advanced undergraduate course with focused examples 
from programs of study in computer engineering, materials engineering, chemical 
engineering, nuclear engineering, and engineering physics. A broad range of topics 
are covered, including Laplace and Fourier transform theory; analytical methods of 
solution for ordinary and partial differential equations; numerical solution of ordi- 
nary and partial differential equations based on finite difference and finite element 
techniques; matrices, linear systems, and vector analysis; treatment for experimental 
results; numerical techniques of finding zeros of functions, interpolation, integration, 
and differentiation; and complex analysis. It also presents advanced mathematical 
concepts missing in traditional texts, such as nondimensionalization, nonlinear dif- 
ferential equations, integral equations and the Green’s function, and calculus of vari- 
ations. This book uniquely provides both the underlying theory and its application 
using state-of-the-art computational tools available to the engineer. 


e It demonstrates and summarizes for the engineer new technology for applied 
mathematical analysis with algebraic, numerical, and statistical commercial soft- 
ware packages for day-to-day analysis. 

e Example problems in the chapters are revisited with a demonstration of commer- 
cial tools to quickly solve problems introduced throughout the book. 
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CHAPTER 


Prologue 


1.1 Introduction 


A differential equation is a mathematical construct involving derivatives (i.e., the 
rates of change of certain quantities) that includes an expression with one or more 
independent variables. It may represent a quantity embodied in a physical law of 
nature that may arise in such fields as mathematics, physics, astronomy, biology, 
economics, or engineering. 

Differential equations can be solved in certain cases as self-contained analytical 
expressions. For more complex and real-life problems that occur in various fields of 
study, a solution may require numerical methods on a computer. 


1.1.1 History of differential equations 


Differential equations were constructs that followed with the invention of differen- 
tial and integral calculus by Isaac Newton and Gottfried Wilhelm Leibniz c.1670. 
This advancement involved the use of infinitesimal quantities to determine tangents 
to curves and the calculation of lengths and areas of curved surfaces. The more mod- 
ern terms of derivatives and integrals, respectively, were described by Newton as 
“fluxions” and “‘fluents,”’ while Leibniz referred to these entities as “differences” and 
“sums.” Newton specifically described two ordinary differential equations and one 
partial differential equation that could be solved employing an infinite series method 
(Sasser, 2018): 


The “Bernoulli” ordinary differential equation was proposed by Jacob Bernoulli 
in 1695, which was further simplified and solved by Leibniz in 1696. The vibrating 
string problem was mathematically studied by Jean le Rond d’Alembert, Leonhard 
Euler, Daniel Bernoulli, and Joseph-Louis Lagrange. The one-dimensional wave 
equation studied in 1746 by d’ Alembert was generalized to three dimensions by Euler 
within a decade. The tautochrone problem (that is, an isochrone curve along which a 
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body will fall with uniform vertical velocity), as formulated by Euler and Lagrange in 
the 1750s, was subsequently solved in 1755 by Lagrange. Both Euler and Lagrange 
further developed this type of methodology for application to mechanics, leading to 
the important formulation of Lagrangian mechanics. Fourier also published his math- 
ematical work on the conductive flow of heat in 1822. 

A history of ordinary differential equations over 100 years is briefly summarized 
in Table 1.1. 


Table 1.1 History of ordinary differential equations. 
Date | Problem identity Problem description Mathematician 
1690 | Isochrone problem Finding a curve along James Bernoulli 
which a body will fall with 
uniform vertical velocity 
1691 | Quadrature of the hyperbola | Finding a square G. W. Leibniz 
equal to the area 
under the curve 
on a given interval 


1696 | Brachistochrone problem Finding the path John Bernoulli 
from which a particle 

will fall from one point to 
another in the shortest time 


1698 | Orthogonal trajectories Finding a curve which John Bernoulli 
cuts all curves of a family 
of curves at right angles 
1701 | Isoperimetric problem Making an integral a maximum | Daniel Bernoulli 
or minimum while keeping 
constant the integral 

of a second given function 
1728 | Reduction of second-order | Finding an integrating factor Leonard Euler 
to first-order equations 
1734 | Singular solutions Finding an equation of Alexis Clairaut 
a family of curves represented 
by a general solution 


1743 | Determining an integrating Concept of the adjoint Joseph Lagrange 
factor for a of a differential equation 
general linear equation 

1762 | Linear equation with Conditions under which Jean d’Alembert 
constant coefficients the order of a linear 


differential equation 
can be lowered 


se eee 
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1.1.2 Organization of the book 


The book consists of 14 chapters. It provides an important foundation in applied 
mathematics and demonstrates the use of modern technology for applied mathe- 
matical analysis with commercial software packages (e.g., algebraic, numerical, and 
statistical). The reader is first introduced to fundamental theory for the analytical and 
numerical solution of ordinary and partial differential equations. Example problems 
in the chapters are then revisited with a demonstration of the use of the COMSOL 
tool to quickly solve all differential equations previously introduced with the use 
of a finite element method. This work therefore demonstrates the power of modern 
and state-of-the-art computational technology available to the engineer. Hence, this 
treatment provides the underlying theory in order to understand how the tools can be 
applied. 


The chapters of the book are organized as follows: 


e Chapter | provides a historical perspective of the development of differential equa- 
tions and introduces the organization for the book. 

e Chapter 2 discusses ordinary differential equations, including first-, second-, and 
higher-order equations as well as systems of equations. Solution methods are dis- 
cussed for common engineering problems. This material is also needed for the 
solution of partial differential equations in Chapter 5. 

e Chapter 3 describes Laplace and Fourier transform methods that provide an alter- 
native analytical method to solve partial differential equations in Chapter 5. Also 
introduced are discrete Fourier transforms and fast Fourier transforms that find 
application in signal processing and for solution of boundary value problems. 

e Chapter 4 summarizes the use of matrices for solving linear systems of equa- 
tions. For instance, numerical solution of differential equations can be effectively 
formulated, reduced, and solved as a matrix problem. Vector calculus is further 
introduced for the derivation of important theorems for vector fields that arise in 
engineering. 

e Chapter 5 includes a treatise of analytical methods for solving partial differen- 
tial equations. This treatment considers the standard techniques of separation of 
variables and transform methods. 

e Chapter 6 examines difference numerical methods for differential equations as a 
numerical approach when the analytical methods of solution in Chapter 5 are not 
feasible. Such techniques are specifically needed for engineering problems that 
commonly arise in real-world situations. 

e Chapter 7 further examines finite element techniques as an alternative numeri- 
cal approach for the solution of ordinary and partial differential equations. This 
technique is used, for instance, in various engineering disciplines. This method of 
solution is used specifically in the COMSOL commercial software package and 
can be applied for multiphysics applications. 

e Chapter 8 describes the treatment of experimental results, which is an essential 
component for analyzing experimental data and understanding error propagation 
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in measurements and experimental design. Regression analysis is further detailed 
and demonstrated with the Excel commercial software package. 

e Chapter 9 details general numerical techniques needed to interpolate and smooth 
measured data, finding the zeros of functions, and perform numerical integration 
and differentiation. 

e Chapter 10 provides an introduction to complex analysis. For example, the residue 
integration method can be used to find inverse Laplace transforms that may not be 
found in standard mathematical handbooks for solution of ordinary and partial 
differential equation problems. The complex potential that arises in conformal 
mapping can be used to solve analogous problems in fluid flow, electrostatics, and 
heat transfer. 

e Chapter 11 discusses the concept of dimensional analysis for computing sets of 
dimensionless parameters to provide a functional relationship between dimension- 
less groups. It also describes the nondimensionalization of equations used in the 
scaling of laboratory models to full-scale systems or for simplification of differ- 
ential equations. 

e Chapter 12 extends the discussion to nonlinear differential equations which occur 
in nature. 

e Chapter 13 describes integral equations, where problems can be recast as an in- 
tegral rather than as a differential equation and vice versa. An integro-differential 
equation for neutron transport, for example, arises in nuclear reactor design. An 
integral equation provides a solution method where the boundary conditions of a 
differential equation can be incorporated into the kernel of an integral equation. 
This approach leads to the development of the Green’s function. 

e Chapter 14 describes the calculus of variations that complements ordinary differ- 
ential calculus. It provides a powerful technique to find an optimum quantity to 
be minimized (or maximized). It leads to the development of the Euler-Lagrange 
equation, with application to find the motion or optimal shape of an object, or 
determine the shortest path on a surface (geodesic). 


The relationships and interconnections of the subject matter in the various chap- 
ters of the book are highlighted in Fig. 1.1, where DE stands for differential equation, 
ODE stands for ordinary differential equation, and PDE stands for partial differential 
equation. 


1.1.3 Use of the book for course instruction 


For a course of 3 contact hours per week lasting two full-term semesters, the fol- 
lowing material was covered. For the first semester, as an introductory review, a 
few lectures were given on ordinary differential equations for the benefit of the stu- 
dents culminating in a summary of solution methods in Section 2.1.1. This material 
is needed as more advanced techniques are introduced and was included in the course 
instruction for completeness. Thus, during the first semester, the complete Chapter 2 
(Sections 2.1 to 2.5) and parts of Chapter 3 (Section 3.1) and Chapter 4 (Sections 4.1 
to 4.3) were presented. During the next semester, the remaining material in Chapter 3 
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Chapter 4: Matrix methods 


Chapter 5: PDE's(Analyticalsolution) 
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FIGURE 1.1 


Interconnection of chapter material. 


(Section 3.2) was covered along with Chapter 5 (Sections 5.1 and 5.2), Chapter 6 
(Sections 6.1 and 6.2), part of Chapter 7 (Section 7.1), Chapter 8 (Sections 8.1 to 
8.4), and Chapter 9 (Sections 9.1 to 9.5). Section 7.1 was essentially a one-period 
demonstration of COMSOL, where all problems in Chapter 6 were solved again. 

The remaining material, not taught at the undergraduate level, includes Chapters 
10 to 14, which cover more advanced material that can be used for specialized course 
instruction in graduate teaching. It can also be used for future reference for practicing 
engineers. 

All problems at the end of the chapters are solved in a separate solution manual 
that may be made available to the instructor. These problems consist of all assign- 
ments, quizzes, and examination questions given for these courses, plus additional 
problems introduced for advanced material in the book. 


CHAPTER 


Ordinary differential 
equations 


An ordinary differential equation is a differential equation that contains one or more 
functions of only one independent variable as well as derivatives of these functions. 
This can be contrasted to partial differential equations in Chapter 5 that contain more 
than one independent variable. The functions can represent physical laws to model 
dynamical systems and arise in many problems of biology, economics, physics, 
chemistry, and engineering. 

Techniques for solving first-order, second-order, and higher-order equations are 
presented. Solution methods for systems of ordinary differential equations are also 
discussed that result in the formulation of problems involving eigenvectors and eigen- 
values. Higher-order ordinary differential equations can also be recast as a system of 
first-order equations. Using power series methods for the solution of some types of 
second-order differential equations, including the Frobenius method, this methodol- 
ogy can give rise to important functions as solutions of these equations that occur in 
engineering. For example, a Bessel function will arise as a solution of a second-order 
differential equation when the geometry of a body is cylindrical in nature. It is fur- 
ther shown how special functions, such as Bessel and Legendre functions, are special 
cases of the more general Sturm—Liouville problem. Solutions of Sturm—Liouville 
problems lead to orthogonal sets of eigenfunctions, which can be used for series 
expansion of given functions. Such representations include Fourier series, Fourier— 
Bessel series and Fourier—Legendre series, among others. This type of analysis is 
important for engineering problems in heat conduction, fluid flow, and mass transfer 
for instance, where it is used to apply boundary conditions for the solution of partial 
differential equations. 


2.1 First-order equations 


Ordinary differential equations are equations involving derivatives and only one in- 
dependent variable. 
For example, 


dy 
y =tan(x) +2, where y = —, (2.1) 
dx 
” UA d?’y 
y +2y—4=0, where y =- (2.2) 
dx 
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The order of a differential equation is the order of the highest derivative that appears 
in the equation (for example, the orders of the examples in Eqs. (2.1) and (2.2) are 1 
and 2, respectively). 

Hence, first-order differential equations can be written as 


F(x,y, y) =0, (2.3) 


or sometimes [for example, see Eq. (2.1)], 


y= f(x,y). (2.4) 


Solution of first-order differential equations 
A solution is a function in the form 


y = h(x), (explicit solution), (2.5) 
H (x, y), (implicit solution), (2.6) 


which satisfies the differential equation in Eq. (2.3) on some open intervala < x < b. 

A general solution of a first-order differential equation involves an (essential) ar- 
bitrary constant. A particular solution of a first-order differential equation is obtained 
by assigning a specific value to the constant. 


Example 2.1.1. (i) The formula y = cos(x) + c is a general solution of y’ + sin(x) = 
0, and (ii) y = cos(x) + 5 is a particular solution. 


A singular solution is a solution which cannot be obtained from the general solu- 
tion by specifying the value of the constant. 
In most applications, one is presented with an initial value problem: 


y = f(x,y), Yo) = Yo, 2.7 


where x, and y, are given numbers in which the initial condition y(x) = Yo is used 
to determine a particular solution. 


Separable differential equations 
Many first-order differential equations can be reduced to the form 


gO)dy = f(x)dx, (2.8) 


which leads to the following solution by integration on both sides: 


[ soav= f fde. (2.9) 


re) = = = — 
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Example 2.1.2. Solve y’ + a yx? = 0 (where a is a constant). This ordinary differ- 
ential equation arises in the heat transfer problem of Example 5.2.13. By separation 
—x3a/3 


of variables, dy/y = —a x? dx. Using Eq. (2.9), one obtains y = ce . [answer] 


Modeling (separable differential equations) 


Example 2.1.3. Initial value problem: radioactive decay. In radioactive decay, the 
rate of loss of the number of atoms (—d N /dt) is proportional to N, where the pro- 
portionality constant is termed the radioactive decay constant A such that dN/dt = 
—AN. 

Determine (a) the law for radioactive decay subject to the initial condition N (t = 
0) = No and (b) the relationship between the radioactive decay constant and half-life 
of the material. 


Solution: 

(a) Using separation of variables, dN/N = —Adt. Analogous to Example 2.1.2, the 
solution is N(t) = ce™™ . Hence, the initial condition N(0) = ce? = N, implies c = 
No. The radioactive decay law is therefore N (t) = No e™™ , [answer] 

(b) The half-life 7; 2 is defined as the time after which the radioactive substance has 
decreased to half its original value No. To determine the radioactive decay constant, 


In(2 
Moe * Nn = 5 <, which implies A = a, [answer] 
1/2 


Example 2.1.4. Radiocarbon dating. In the atmosphere (and living organisms), the 
ratio of radioactive carbon n C and rae is constant. When an organism dies the ab- 
sorption of carbon ceases. Hence, the age of a fossil can be determined by comparing 
the carbon ratio in the fossil with that in the atmosphere. 


If a fossil contains 25% of the original amount of 14 C, what is its age? 


Solution: For radioactive decay from Example 2.1.3, N (t) = Noe™™ , where No is 
the initial amount of 14C and the half-life of (14C) is 5730 years. 


The radioactive decay constant 4 = In2/(5730) = 0.000121 y~!. Thus, the time after 
which 25% of '4C is still present aces by solving M,e~ 9000121" — 1y, 
n 


Hence, the age of the fossil is t = ————__ 
—0.000121 


= 11, 460 years. [answer] 


Example 2.1.5. Newton’s law of cooling. A container at a temperature of 45°C is 
placed into a cooler at time t = 0, where the cooler is at a temperature T, = 5°C. 
After 2 minutes, the temperature of the container T is reduced to 25°C. At what time 
is the temperature of the container equal to 15°C? 


OOOO. 
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Solution. 


dT 
(i) Modeling. Newton’s law of cooling is Tr = k (T — To), where Tọ is the temper- 


dT 
ature of the surrounding medium. Hence, the differential equation becomes 7 
k(T —5). 
(ii) General solution. Separation and integration of the above differential equation 


= kdt. Thus, In(T — 5) = kt +c’, which implies T(t) = ce! +5. 


dT 
ield 
yields ros 
(iii) Particular solution. Using the initial condition T(0) = ce? + 5 = 45 yields 
T(t) =40e" +5. 


25-5 
(iv) Determine k. Given T (2) = 40e*? +5 = 25, which implies k = 4 in( 40 ) = 


—0.3466 min—!. Therefore, for the answer, T(t) = 40e— 93466! 4.5 = 25, so that 
t = 4 minutes. [answer] 


Reduction to separate form 
Equations of the form 


y=s8(2) (2.10) 


can be made separable by a change of variables. Let 
y 
-=u or y=Xu. (2.11) 
x 


Product differentiation of Eq. (2.11) gives 


/ f / du 
y=u+xu, where u = —. (2.12) 
dx 
The right-hand side of Eq. (2.10) is 
y 
s (Žž) = g(u). (2.13) 
x 
: : du 
Therefore, equating Eq. (2.12) and Eq. (2.13) gives u + xu’ = g(u) or RTE = 
g(u)—u 
d 
= Integrating both sides yields 
x 
du 
nx = | —*— +e, (2.14) 
glu) —u 


y 
where u = ^. 
x 


re) = 
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Example 2.1.6. Solve xyy’ + y? +4x? = 0. Dividing this equation by x? gives Zy + 
x 


yy? , ; . u’ +4 
(*) +4=0. Setting u = y/x and solving for y’ yields y’ = -——— = g(u). 

x u 

1 d 
Therefore, Eq. (2.14) yields the result ln x = — z f s +c’. Thus, on integrating, 
u 
the final solution is y? = = — 2x”. [answer] 
x 
Exact differential equations 
A first-order differential equation of the form 
M(x, y)dx + N(x, y)dy =0 (2.15) 


is called exact if its left-hand side is the exact differential 


Ou ou 
du = —dx + —dy (2.16) 
ox oy 
of some function u(x, y). Then the differential equation in Eq. (2.15) can be written 
as du = 0, and by integration it follows that 


u(x, y)=c. (2.17) 


Comparing Eq. (2.15) and Eq. (2.16), Eq. (2.15) is exact if 
(a) — = and (b) — =N. (2.18) 
x y 


If M and N have continuous first partial derivatives, then 


ƏM ə? ƏN a? 
Se Gad ee, (2.19) 
oy dyox ox  ðxðy 
By continuity of the two second derivatives, 
dM ƏN 
— = —. (2.20) 
dy Ox 


In fact, Eq. (2.20) is not only necessary but also sufficient for Eq. (2.15) to be an exact 
differential. 
On integrating Eq. (2.18)(a) with respect to x, 


u = f max +k(y). (2.21) 


To determine k(y), we derive du/dy from Eq. (2.21), use Eq. (2.18)(b) to get dk/dy, 
and then integrate. 
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Example 2.1.7. Solve 
(3x7 y)dx + (x3 + 3y)dy =0. (2.22) 


Solution. 


aM 
(i) Test for exactness. We have M = 3x*y and N = x? + 3y?. Thus, = 3x? = 
y 


aN 
aa Therefore, Eq. (2.22) is exact. 
x 
(ii) Implicit solution. From Eq. (2.21) the implicit solution is u = f Mdx + k(y) = 
0 
J (3x?y) dx + k(y) = x3 y + k(y). To find k(y), use Eq. (2.18)(b) such that = = 
y 


3 dk 3 2 . _ ak PEA : 34 
x tpo Ama +3y - Thus, one obtains ~- = 3y with the solution k = y? + c’. 
y 


The final solution is 
AS (a y+ y’) =c. [answer] (2.23) 


(iii) Check implicit solution u(x, y) = c. Differentiating Eq. (2.23) with respect 
to x gives (3x*y + x3y’ + 3y7y’) = 0. This latter expression simplifies to 3x7 y + 
(x? + 3y) y’ =0, which, in turn, yields Eq. (2.22) since y' = dy/dx. 


Integrating factors 
Sometimes an equation 
P(x, y)dx + Q(x, y)dy =0 (2.24) 


is not exact, but it can be made exact by multiplying by an integrating factor F(x, y) 
such that F Pdx + F Qdy = 0, where (compare with Eq. (2.20)) 


(FP) _ d(FQ) 


: 2.25 
dy Ox ( ) 
Moreover, if F only depends on one variable, that is, F = F(x), Eq. (2.25) yields 
oP OF ð 
F-— =Q +F Q 
dy ox ax 


Dividing through by F Q and rearranging gives 


10F 1/ƏP aQ 
Fax Q\dy ax)’ 


(2.26) 


Thus, if the right-hand side of Eq. (2.26) only depends on x, that is, R(x) = 
1/0P ð 
= (5 — z2) , then the integrating factor can be obtained from 
y x 


F(x) =exp f R(x)dx. (2.27) 


ee) = = — 
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Alternatively, if F = F (y), then instead of Eq. (2.26), one has 


1aF 1 (22 PP) 
= (2.28) 
F əy P\ ox dy 
and 
Fy) =exp | Rody. (2.29) 


Example 2.1.8. Solve the radioactive decay equation of Example 2.1.3 with a con- 
stant source term Re: 


UN ok XN (2.30) 
dt ~ ; í 


with the initial condition N (t = 0) = No. 
Solution. Eq. (2.30) can be equivalently written as 
(Re— à N)dt— dN =0. (2.31) 


Thus, one can identify P = (Re — à N) and Q = —1. Hence, 


= 1/əP Q 7 
ro- (5 2)- I[-A]=À 


ro=exp] faar) =e 


Therefore, multiplication of Eq. (2.30) by this integrating factor gives 


and 


dN dNe™ 
—e = Re“ —) Ne™ or Ned) 


= Ree™. 
dt dt 


Thus, separation of variables with N (0) = No yields the solution: 
R 
N(t)= a (l-e™“)+Noe™. [answer] 


This problem can represent the rate of release of radioactive fission products into a 
reactor coolant, as shown in Example 9.4.1. 


Example 2.1.9. Consider the mixing of salt into a tank containing a volume of water 
V (t) (m?) with a mass of salt m(t) (kg). Instantaneous mixing in the tank is assumed 
with a volumetric flow rate of water into the tank ù; (t) (m?/s) with a salt concentration 
c(t) (kg/m?) and a volumetric flow rate of water out of the tank ùo (t) (m3/s) with a 


OOOO. 
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salt concentration co(t) (kg/m? ). The conservation equation for the volume of water 
in the tank is 
dV(t) 
dt 
Assuming that the flow rates in and out of the tank are equal and constant, ù; (t) = 
Vo(t) = ù, the solution of this differential equation for the volume of water is simply 
V(t) = Vo. Similarly, the conservation equation for the mass of salt in the tank is 


= ù; (t) — V(t). 


dm _ og 
Fn = tlel) — col). 


Moreover, assuming a constant inlet concentration of salt c;(t) = c; with an outlet 
m(t) 


concentration of co (t) = , the mass balance for the salt becomes 


oO 


dm(t) | 


. ù 
J ùci vn (2.32) 


Solve for the mass of salt in the tank m(t) and the outlet concentration of salt 
Co(t). 


Solution. From Example 2.1.8, noting that Eq. (2.30) and Eq. (2.32) are of the same 
form where on comparison Re —> ùc; and 4 > v/ Vo, the solution follows on apply- 
ing an integrating factor with the initial condition m(t = 0) = mo: 


m(t) = ci Vo (1 ag ®/ a) + my eW &/ Vo) [answer] 


and 


ime (1 — e70 w] + (mo/Vo)e7®/ Y", [answer] 


Linear differential equations 
A first-order linear differential equation is of the form 


y + p(x)y =r(x). (2.33) 


If r(x) = 0, then the equation is called homogeneous. In this case, by separating vari- 


d 
ables, the following solution follows: at = — p(x)dx. Thus, ln y = — f p(x)dx +c! 
y 


or 


yx) = ce7 f Pedr, (2.34) 


The nonhomogeneous Eq. (2.33) can be solved with an integrating factor where 
Eq. (2.33) is written as (p(x)y — r)dx + dy = 0, so that P = py —r and Q = 1. 


ee = = — 
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1 dF 
Therefore Eq. (2.26) becomes Fa” p(x) and Eq. (2.27) yields the integrating 
x 


factor 
F(x) = el Pe, (2.35) 


1 
Thus, multiplying Eq. (2.33) by Eq. (2.35) gives ef Pax (yl 4 py)= (e! pax y) = 


el pdx}. On integrating with respect to x, ef Py = f ef Pax rdx + c and solving for 
y, 


ya)=e" || e"rdx + e , h= 1 p(x)dx. (2.36) 


Example 2.1.10. Solve y’ + y = sinh x. Here, p = 1,r = sinh x, and h = f pdx =x. 


1 Xx 
From Eq. (2.36), y(x) =e * i e* sinhx dx + c] z3 E -ze> | +ce™. [answer] 


Reduction to linear form 


The Bernoulli equation is given by 


y + p(x)y = g(x)y* | (aisa real number). (2.37) 


This equation can be reduced to a linear form on letting 

u(x) = [y]. (2.38) 
Hence, differentiating Eq. (2.38) and substituting y’ from Eq. (2.37) gives 
u' = (1—a)y “y’ = (1—a)y“(gy* — py) = (1—a)(g— py!) = (1 —a)(g — pu). 


Hence, one obtains the linear equation u’ + (1 — a)pu = (1 — a)g of the form of 
Eq. (2.33). 


2.1.1 Summary of solution methods 
Any first-order differential equation can be put in the form 


d 
Z =f(x,y) or M@&,y)dx+N(,y)dy=0, (2.39) 
Xx 


where the general solution contains an arbitrary constant. The following Table 2.1 
summarizes various techniques for finding the general solution of Eq. (2.39). 
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Table 2.1 General solution of various first-order differential equations. 


Differential equation 
1. Separation of variables 
Si@)gi(ydx + fo(x)ga(y)dy 


2. Exact equation 
M(x, y)dx + N(x, y)dy =0, aM _ an 


dy ax 


3. Integrating factor 
M(x, y)dx + N(x, y)dy =0, SM 4 


4. Linear equation 
et P@)y=r(x) 


5. Reduction to separable form 


=e?) 


6. Bernoulli's equation 
Z + p(x)y =aa)y", a £0, 1 


General solution (or method to obtain it) 


Divide by gı (y) f2(x) 40 and integrate to obtain 


fix) g2(x) 
J Faye + S Fe 


dy=c. 


The equation can be written as 

Mdx + Ndy =du(x, y)=0, 

where du is an exact differential. Thus, the 
solution is u(x, y) =c or equivalently 


{Max +f[v- Bf Max]dy=c, 
where 0x indicates that the integration is to be 


performed with respect to x keeping y constant. 


The equation can be written as 

FMdx + FNdy =0, 

where F is an integrating factor so that 

2 (FM) = È (FN) and then method 2 applies. 


dy 


An integrating factor is given by F = e/ Pœ)dx 
and the equation can be written 

4 (Fy) = Fr with solution Fy = f Frdx +c or 
y =e" [|f e"rdx +c], where h = f pdx. 


Let y/x = u, and the equation becomes 

u +x% = g (u) Or xdu — (g (u) — u)dx = 0, 

which is of the form of type 1 and has the solution 
Inx =f aE +c, 

where u = y/x. If g(u) = u the solution is y = cx. 


Letting u = y'~¢ the equation reduces to type 4 
with solution 

uel- J pdx — (1 — a) f gets pax dx +c. 

If a = 0, the equation is of type 4. 

If a = 1, it is of type 1. 


eae = = 
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Table 2.1 (continued) 


Differential equation General solution (or method to obtain it) 
7. Equation solvable for y 


y = g(x, p), where p= y’ Differentiate both sides of the equation with respect to x, 


dy _ dg _ dg , dg dp 
dx dx ax ' Op dx 


or p= 8 + Meee, 

[hen solve this last equation to obtain G(x, p, c) =0. 

The required solution is obtained by eliminating p between 
G(x, p,c) and y = g(x, p). An analogous method exists if 
the equation is solvable for x. 


8. Clairaut’s equation 
y= px + g(p), where p = y' | The equation is of type 7 with solution 

y=cx+ g(c). 

The solution will also have a singular solution in general. 


2.1.2 Approximate solution by iteration 


Picard’s iteration method provides an approximate solution to the initial value prob- 
lem in Eq. (2.7). By integration, Eq. (2.7) may be written in the form 


x 
y = yo + f f(t, y(t))dt. (2.40) 
Xo 
Successive approximations to the solution y(x) in Eq. (2.40) are obtained as 
Y= Yo; 
x 
MA=Yot | f(t, Yo))dt, 


Xo 


eae f F(t, dt, 


nt) = yo | F(t, Yn-16))dt. (2.41) 


Example 2.1.11. Solve y'= 1 + y, y(0) = 0. 


Solution. Here xo = 0, yo = 0, f(x, y) = 1 + y, and Eq. (2.41) becomes y(x) = 


x 


x 
f [1 + yn-1(t)]dt = x + yn—1(t)dt. Solving from yo = 0 gives 
0 0 


an 
18 CHAPTER 2 Ordinary differential equations 


x 
nayea+ f Odt =x, 
0 


x x2 
nay=a+ f tdt =x + —, 
0 2 
x 12 x2 x3 x4 
= t dt = : 2.42 
y3(x) s+ (+5 5) x+ tera [answer] ( ) 


Alternatively, the solution can be obtained from separation of variables (see Sec- 
d i 
tion 2.1): Ee = dx. On integrating, In(1 + y) = x +c’ so that y = ce” — 1. But 
y 


2 x3 4 


the condition y(0) = 0 yields y = e* — 1 = x + a + pi + a + ... for the range 


(—0o < x < œ). This latter expression agrees oie A a 


2.1.3 Existence and uniqueness of solutions 

Given the initial value problem in Eq. (2.7), it is of practical importance that a given 
model has a unique solution before one tries to compute a solution. In particular, it is 
important to determine the conditions that: 


(i) the initial value problem has at least one solution (existence), 
(ii) the initial value problem has at most one solution (uniqueness). 


Existence theorem 

If f(x, y) is continuous at all points (x, y) in R, |x — xo| < a, |y — yo| < b, and 
bounded in R, such that | f(x, y)| < K for all (x, y) in R, then the initial value prob- 
lem has at least one solution y(x). This solution is defined at least for all x in the 


interval |x — xo| < œ, where « is the smaller of two numbers a and b/K. 


Uniqueness theorem 
If f(x, y) and a are continuous for all (x, y) in R and bounded such that (a) | f| < K 


and (b) Y < < M for all (x, y) in R, then Eq. (2.7) has at most one solution y(x). 
Hence, by the existence theorem it has precisely one solution which is defined at least 


in the interval |x — xo| < œ. In fact, it can be obtained by Picard’s method, where 
Eq. (2.41) (with n = 1, 2, ...) converges to that solution y(x). 


Consider Example 2.1.11 and take R: |x| < 5 and |y| < 3 (so that a = 5 and b = 3). 
Thus, 


a 
IfJ=l+yl < K =4, #=1sM=1, a =b/K =0.75 <a. 
y 


ee) = = = — 


2.2 Second-order linear differential equations 19 


2.2 Second-order linear differential equations 
2.2.1 Homogeneous linear equations 


A second-order differential equation is linear if it has the form 


y” + p(@x)y’+q@)y =r(x). (2.43) 


If r(x) = 0, the equation is homogeneous: 


y” + p(x)y' + q(x)y =0. (2.44) 


For example, (1 — x?) y” + 2x? y’ + 5y =0. 


Eq. (2.44) has the important property that a linear combination of solutions is 
again a solution (superposition principle or linearity principle). Two linearly inde- 
pendent solutions yı and y2 of Eq. (2.44) form a basis of solutions, where the 
general solution is given by 


y=c1y1 + c2y2 (2.45) 


and cı and c2 are arbitrary constants. A particular solution from Eq. (2.45) is obtained 
if one specifies numerical values for cı and cz by imposing two initial conditions: 


y(xo) = Ko, y'(%o) = Ki1| (Xo, Ko, and K are given numbers). (2.46) 


Together Eq. (2.44) and Eq. (2.46) constitute an initial value problem. If p and q 
are continuous on some open interval I and x, is in I, then Eq. (2.44) has a general 
solution on I, and Eq. (2.44) and Eq. (2.46) has a unique solution on I (which is 
a particular solution; thus, Eq. (2.44) has no singular solutions). Some applications 
may also lead to the boundary conditions 


y(P1) =k, yCP2) =ko,| (Pi, Po, ki, and kz are given numbers), (2.47) 


where Pı and Pz are endpoints on I. Together Eq. (2.44) and Eq. (2.47) constitute a 
boundary value problem. 


Linear independence 
Two functions y; (x) and y2(x) are linearly independent on an interval I if 


kıyı (x) + ko yo(x) = 0 on I implies kı = 0 and kz = 0, (2.48) 


that is, yı and y2 are not proportional to one another. 


ae 
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Table 2.2 General solution of second-order homogeneous differential equations 
with constant coefficients. 


Case | Roots (Eq. (2.50)) Basis (Eq. (2.49)) | General solution 

I Distinct real 41, A2 eh IX | eh2x y=cye*!* + cze 

I Real double root A = -3 e 2 xe T y=(a+ ox)e T 

lll Complex conjugate 
1 =-—5 +iow eT 2 cosw y=e 2 (Acoswx + B sinwx) 
2 =—-§ -iw eT? sinw 


2.2.2 Homogeneous equations with constant coefficients 


If p(x) and q(x) are constant (that is, p(x) = a and q(x) = b), one obtains the ho- 
mogeneous linear equation 


y” +ay' + by =0. (2.49) 


This equation can be solved by substituting y = e**, yielding 
(a? +aà + b) e = 0, 


Thus, A is a root of the characteristic equation (or auxiliary equation) 


MW +a +b=0. (2.50) 


The solution of Eq. (2.50) is 


ie La + Va? — 4b), ha = $a ~ Va? — 40). (2.51) 


Depending on the discriminant a? — 4b, one obtains three cases: 


Case I: two real roots Aj, Az if a? — 4b > 0; 
Case II: a real double root A, = A2 = —a/2 if a? —4b=0; 
Case III: complex conjugate roots 4; = —5 + im and Az = —5 — iw, where 


2 
(o= yb- z), if a? — 4b <0, 
The general solution of Eq. (2.49) for the three cases is given in Table 2.2. 
Example 2.2.1. (Case I) Solve y” — 4y =0. 


Solution. Here a = 0 and b = —4, which implies 4; = 2 and A2 = —2 from Eq. (2.51). 
Therefore, the general solution is y = c 1e?” + cze”. [answer] 


ee) = — 


2.2 Second-order linear differential equations 21 


Example 2.2.2. (Case II) Solve y” + 2y’+y=0. 


Solution. Here a = 2 and b = 1, which implies a double root à = —1 (since a? —4b = 
0) from Eq. (2.51). Therefore, the general solution is y = (c1 + c2x)e™™*. [answer] 


Example 2.2.3. (Case III) Solve y” + 2y’+5y =0. 


Solution. Here a = 2 and b = 5, which implies complex conjugate roots (since a? — 


2 2 
4b < 0) from Eq. (2.51). Therefore, 4; = —1 + i,/5 — © 


—1 — 2i. The general solution is y = e~*(Acos 2x + B sin 2x). [answer] 


= —] + 2i and à2 = 


Example 2.2.4. (Kirchhoff’s second law) Consider an electric current I(t) flow- 
ing through a circuit containing a resistance R, capacitance C, and inductance L 
(Fig. 2.1). These quantities are related through Kirchhoff’s voltage law as given by 
the integro-differential equation (see Chapter 13) 

1 t 
LIY +RI + =/ I(s)ds=0. 


to 


Differentiating this equation with respect to time ¢ yields a second-order homoge- 


I(t) 
FIGURE 2.1 


Diagram of an LRC circuit. 


neous ordinary differential equation with constant coefficients: 
10)’ +(2) 10@+()1@=0 
L LC 


I” +2wgl' +w?1=0. 


or 


Here wg = R/(2L) and w2 = 1/(LC). Thus, the characteristic equation is a? + 


2wa + w2 = 0 with roots à1,2 = —wa + ,/ w3 — w2. If R = 0, then wy = 0 and 
à = +i wọ. Hence, from Table 2.2, the solution is 


I(t) = Acos(wot) + B sin(wot). 


ae 
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This result implies that with no resistance, the circuit oscillates with no dissipation. 
On the other hand, if R < ./4L/C (such that w3 < w2), A= —wa £i,/w2 — w3 = 
—wq + if, where f =,/ we — w2. For this case, from Table 2.2, 


I(t) = Ae~“4' cos(bt) + Be” sin(bt). 


Consequently, the presence of a resistor damps the current oscillations produced by 
the capacitor and inductor. 


Example 2.2.5. (Damped oscillator/spring) Consider a mass m at the end of a spring 
(Fig. 2.2). From Newton’s second law of motion, the force on a body is equal to the 
mass times the acceleration of the body (that is, my”(t)). This force is equal to the 
sum of all forces acting on the body. From Hooke’s law for the force exerted by the 
spring for a displacement y, Fs = —k- y, where the spring has a constant k > 0. There 
is a friction force by the air that is proportional to the velocity of the object such that 
F, = —d - y'(t), where d > 0 is a damping coefficient. Hence, for this given force 
balance, 


my" =—dy'—kyormy"+dy'+ky=0. 


FIGURE 2.2 


Diagram of a spring system. 


Similar to Example 2.2.4, with wg = d/(2m) and w2 = k/m, the characteristic 


equation à + 2wa + w2 = 0 has roots à1 2 = —wg + if, where 6 = ,/ w2 — w2. 
Hence, from Table 2.2, the solution is 


y(t) =e “4' (Acos(Bt) + Bsin(Bt)). 
Equivalently, in terms of an amplitude A and phase ¢ for the damped oscillator, 


y(t) = Ae”! cos(Bt — ). 
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2.2.3 Euler-Cauchy equation 
The Euler—Cauchy equations of the form 


xy" +axy’ +by=0]| (a,b constants) (2.52) 


can also be solved by an algebraic method by substituting y = x” into Eq. (2.52), 
yielding 


x2m(m—1)x™-2 +axmx™-!+ bx" =0. 


Omitting x” (where x 4 0) yields the auxiliary equation 


m? +(a—1)m+b=0. (2.53) 


Similarly, three cases arise for the general solution of Eq. (2.52), as shown in Ta- 
ble 2.3. 


Table 2.3 General solution of the Euler-Cauchy equation. 


Case Roots (Eq. (2.53)) Basis (Eq. (2.52))|General solution 
I Distinct real mı, m2 Ml, x™2 y = c1 x”! + coax”? 
ll Real double root m = 1a —a) xa y= (cy +c2Inx)x 632 


lll Complex conjugate 


my =u+iv x" cos(v Inx) y =x"[Acos(v Inx) + Bsin(v Inx)] 


m =u iv x" sin(v Inx) 


Example 2.2.6. (Case I) Solve x7y" — 3xy' +3y =0. 


Solution. The auxiliary equation is m? — 4m + 3 = 0. Hence, mı = 3 and m = 1 
from Eq. (2.53). Therefore, the general solution is y = cix? + cox. [answer] 


Example 2.2.7. (Case II) Solve x?y” — xy’ + y =0. 


Solution. The auxiliary equation has a double root m = 1. Therefore, the general 
solution is y = (c1 + c2 ln x)x. [answer] 


Example 2.2.8. (Case III) Solve x*y"” — xy’ + 2y =0. 


Solution. The auxiliary equation is m? — 2m + 2 = 0, which implies m1,2 = 1+ 
—?2)2 
iyf2— cA = 


1 +i. The general solution is y = x[(A cos(lnx) + B sin(ln x)]. 


[answer] 


EEE 
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2.2.4 Existence and uniqueness of homogeneous equations 


Consider the general homogeneous linear equation 


y” + p(x)y’ +q(x)y =0, (2.54) 


subject to the two initial conditions 


y(xo) = Ko, y' Go) = Kı. (2.55) 


Existence and uniqueness of the initial value problem 


If p(x) and q(x) are continuous functions on some open interval I and x, is in I, then 
the initial value problem Eq. (2.54) and Eq. (2.55) has a unique solution on I. 


Linear independence of solutions 
Define the Wronskian of two solutions yı and y2 as 


y2 


yı 
Wy, y=, <7 =y y2 (2.56) 
Yy Yo 


If Eq. (2.54) has continuous coefficients p(x) and g(x) on some open interval I, then 
the two solutions yı and yp are linearly dependent on I if and only if W = 0 at some 
Xo on I. Furthermore, if W = 0 for x = xo, then W = 0 on I; hence, if there is an x1 
in I at which W £0, then yı and y are linearly independent on I. 


Example 2.2.9. Show that y = (cı +c2x)e~ is a general solution of y” +2y' + y = 
0 for Example 2.2.2. 


Solution. Substituting in the solutions y; = e~* and y2 = xe™™ into the Wronskian 
in Eq. (2.56), the above theorem implies linear independence since 


e xe * 
-e* (1-x)e* 


W(e*,xe *)= = (1 — x)e 7” +2¢ “=e £0. 


[answer] 


General solution of homogeneous linear differential equations 


Existence: If the coefficients p(x) and q(x) are continuous on I, then Eq. (2.54) has 
a general solution on I. 


General solution: If Eq. (2.54) has continuous coefficients p(x) and g(x) on I, then 
every solution on I is of the form 


y(x) = cry1 (x) + c2y2(*), (2.57) 


where y; (x) and y2(x) form a basis of solutions of Eq. (2.54) on I and cı and c3 are 
suitable constants. 
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Method to obtain a second basis solution 


The method of reduction of order provides a means to obtain a second basis solution 
of the homogeneous Eq. (2.54). Thus, if yı is a solution of Eq. (2.54), let y2 = uyy. 
Substituting y2 and the corresponding derivatives y, = u’y; + uy, and yf =u" yı + 
2u'y + uy{ into Eq. (2.54) and collecting terms yields 


uy, +u'(2y; + pyi) +u] + py, +¢y1) =0. (2.58) 


The last term in brackets yf + py, + qy1 =0 since yı is a solution. Thus, dividing 


D / 
Eq. (2.58) by yı and letting u’ = U (and u” = U’) gives U’ + (= + v) U =0. By 
yı 
separation of variables, In U = —21n yı — f pdx, which implies 
1 
U = eS Par, (2.59) 
Yi 
Therefore, 
y2 = Uy] = y1 f Udx. (2.60) 


Example 2.2.10. Given that the solution of y” + ay’ + by = 0 for the case of a double 
root is y} = e~“*/ (see Table 2.2), find the other solution. 


Solution. Here p(x) = a. Then in Eq. (2.59) one obtains — f pdx = —a f dx = —ax. 
Thus, U = e“ . e7% = 1 and u = f Udx = x. Therefore, y2 = uy) = xe~%*/”, as 
given in Table 2.2 (Case II). [answer] 


2.2.5 Nonhomogeneous equations 


Given the nonhomogeneous linear equation in Eq. (2.43) (with r(x) Æ 0) and the 
corresponding homogeneous equation in Eq. (2.44), a general solution of Eq. (2.43) 
is of the form 


Y = Yh + Yp, (2.61) 


where yp is a general solution of Eq. (2.44) and yp is any particular solution of 
Eq. (2.43). A particular solution of Eq. (2.43) is a solution obtained from Eq. (2.61) 
by assigning specific values of the arbitrary constants cı and c2 in ya (x). 


Example 2.2.11. Solve the initial value problem y” — 3y’ + 2y = 12e~?*, y(0) = 1, 
y (0) =-1. 


Solution. 
(i) General solution of homogeneous equations. 
The characteristic equation 22 —34 +2 = 0 has roots 1 and 2. Therefore, Yh = c1e* + 


cae™, 


a 
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(ii) Particular solution of nonhomogeneous equations. 

Try yp =C e~°* (see the method of undetermined coefficients in the next section). 
Then y, = 2C”, yY, = 4Ce~**, and substitution gives 4C¢~** — 3(—2C¢-**) + 
AO = ie Hence, 4C + 6C + 2C = 12, so that C = 1 and the general solu- 
tion of the nonhomogeneous equation is y = yy + yp =cye* + coe + 6%, 

(iii) Particular solution satisfying initial conditions. 

We have y'(x) = cye* + 2c2e* — 2e~**. The initial conditions yield 


y0)=cjy+ea+1=1, 
y (0) =c + 2c2 -2=-1, 


so that c4 = —1 and c2 = 1. Therefore, y(x) = —e* + e* + e~**. [answer] 


In summary, to solve the nonhomogeneous Eq. (2.43), or an initial value problem 
for Eq. (2.43), one solves the homogeneous Eq. (2.44) (as discussed in Section 2.2.2 
and Section 2.2.4) and finds a particular solution y, of Eq. (2.43). Methods for finding 
yp are detailed as follows. 


Method of undetermined coefficients 
This method applies to the equation with constant coefficients 


y” +ay' + by =r(x), (2.62) 


where r(x) is of a special form as detailed in the following rules: 


(a) Basic rule. If r(x) in Eq. (2.62) is one of the functions in the first column of Ta- 
ble 2.4, choose the corresponding function for yp and determine its undetermined 
coefficients by substitution into Eq. (2.62). 

(b) Modification rule. If the choice for yp is also a solution of the homogeneous 
equation corresponding to Eq. (2.62), then multiply the choice for yp by x (or x 
if this solution corresponds to a double root of the characteristic equation of the 
homogeneous equation). 

(c) Sum rule. If r(x) is a sum of functions in several lines of Table 2.4, then choose 
for yp the corresponding sum of functions. 


Example 2.2.12. (rule a) Solve y” + 5y = 25x?. 


Solution. Using Table 2.4, yp = Kax? + Kx + Ko, yielding Yp = 2 K2. Therefore, 
2Ko+ 5(Kox +Kıx+Ko)= 25x2. Equating coefficients on both sides gives 5K2 = 
25, 5K, =0, 2K2 + 5K, = 0, so that K2 = 5, Kı = 0, and K = —2. Therefore, 
Yp = 5x? — 2. Thus, the general solution is given by y = yh + Yp = Acos J5x + 
B sin /5x + 5x? — 2. [answer] 


Example 2.2.13. (rule b) Solve y” — y=e™*. 
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Table 2.4 Choice for the yp function. 


Term in r(x) Choice for yp 
keY* CeY* 


kx” (n=0, 1, ...) Kyx" + Ky—1x"! +... + Kix + Ko 


k cos wx 


f K cos œx + M sin œx 
ksinox 


ke cos wx 


ke sin wx 


| e** (K coswx + M sinwx) 


Solution. The characteristic equation (A? — 1) = (À — 1)(A+ 1) = 0 has the roots 1 
and —1, so that yp = c1e” + c2e™™. Normally Table 2.4 suggests yp = Ce~* (which 
is already a solution of the homogeneous equation). Thus, rule (b) applies, where 
Yp = Cxe™. As such, Yp = C(e-* — xe“) and Yp = C(—2e™* + xe*). Substitut- 
ing these expressions into the differential equation gives C(—2 + x) * — Cxg * = 
g *. Simplifying, one obtains C = —1/2. Thus, the general solution is given by 
y=cye* +c2e* — axe. [answer] 


Example 2.2.14. (rules b and c) Solve y” + 2y'+y =e * — x, y(0) = 0, y’ (0) = 0. 


Solution. The characteristic equation à? +2A + 1 = (à + 1)? = 0 has a double root 
à = —1. Hence, yp = (cy + c2x)e™™ (see Section 2.2.2). For the x-term on the right- 
hand side, Table 2.4 suggests the choice Kix + Ko. However, since à = —1 is a 
double root, rule b suggests the choice Cx?e™* (instead of Ce~*) so that Yp = Kixt+ 
Ko + Cx?e™. As such, Yp + 2y, + yp =2Ce* + Kix + (2Kı + Ko) =e™-—x, 
which implies C = 7 Kı =—1 and Ko =2. The general solution is y = ya + yp = 
(cy tcox)e7* + 5x7e* +2—-—x. 


For the initial conditions, y’ = (—c1 + c2 — cox)e* + (x — 5x7)e"* —1. 
Therefore, y(0) = cı +2 = 0 (so that cı = —2) and y’(0) = —cy + c2 — 1 = 0 (so that 
c2 = —1). Thus, y = —(2 + x)e™* + 5x7e7* + (2 — x). [answer] 


Method of variation of parameters 
This method is more general and applies to the equation 


y” + p(x)y’ + q(x)y =r(x). (2.63) 


The method gives a particular solution y, of Eq. (2.63): 


F yır 
w= | arty f y” , (2.64) 


OOOO. 
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where yı and y2 form a basis of solutions of the corresponding homogeneous equa- 
tion, 


y” + p(x)y’ +.4q(x)y =0, (2.65) 
and W is the Wronskian of y1, yo, 
W =y1¥2 — Yi92- (2.66) 


Example 2.2.15. Solve y” — y = sinh x. 


Solution. The basis solutions of the homogeneous equation are yı = sinhx and 
y2 = cosh x. Thus, W = sinh x sinh x — cosh x cosh x = —1. Hence, from Eq. (2.64) 
(choosing the constants of integration to be zero), 


1 
Yp = sinh x f cosh x sinh xdx — cosh x | sinh x sinh xd x = z7 cosh x — sinh x). 


The general solution is y = ya + yp = [c1 sinh x + c2 cosh x] + 5 (x cosh x — sinh x). 
[answer] 


Note: Had the two arbitrary constants of integration cy and —c2 been included, 
Eq. (2.64) would have yielded the additional general solution cı sinh x + c2 cosh x = 
c1y¥1 + c2y2 (which always results). 


2.3 Higher-order linear differential equations 
Homogeneous linear equations 
An ordinary differential equation of nth order is of the form 


F(x,y, y,..y™) =0, (2.67) 
d"y 
where the nth derivative y™ = ae The equation is linear if it can be written as 
x 
yO + Pn ay"? +.. + PL@)y’ + po@y =r). (2.68) 


The corresponding homogeneous equation written in “standard” form is 


y™ + pr Œy"? +... + pi(a)y’ + polx)y =0. (2.69) 


Eq. (2.69) has the important property that a linear combination of solutions is 
again a solution (superposition principle or linearity principle). A basis of solutions 


ee = = — 
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of Eq. (2.69) consists of n linearly independent solutions of y1, ..., Yn. The corre- 
sponding general solution is a linear combination: 


y(x) =c1y1 (x) +... + Cn yn(X) | (C1, ..., Cn are arbitrary constants). (2.70) 


From Eq. (2.70) one obtains a particular solution by choosing n numbers for c1, ...cy 
by imposing n initial conditions: 


y(xo) = Ko, y'(xo) = Kı, ..., y"—D (x9) = Kn_-1| (xo, Ko, ...Kn—1 are given). 

(2.71) 
Together Eq. (2.69) and Eq. (2.71) constitute an initial value problem for Eq. (2.69). 
If po, .--, Pa—1 are continuous on some open interval I and xo is in I, then Eq. (2.69) 
has a general solution on I, and Eq. (2.69) and Eq. (2.71) have a unique solution on I 
(which is a particular solution). 


Similarly, as in Section 2.2.4, the basis functions y1, ..., Yn are linearly indepen- 
dent on the interval I if and only if the Wronskian (extended to nth order), as defined 
by 

y1 y2 apa Yn 
yl ys a y! 
WO esyd 7) oe A (2.72) 
—1 —1 -1 
oo oe ie 


is different from zero on the interval. 


Homogeneous equations with constant coefficients 
In the case of constant coefficients, the nth-order homogeneous equation becomes 


y™ + ayy") +... Fay’ +doy =0. (2.73) 


Similarly, substituting y = e^ and its derivatives into Eq. (2.73), one obtains the 
characteristic equation 


4 Gea Se a +ao =0, (2.74) 


which has roots 41, A2...A,. Four cases must be considered: 


Case I: Roots all real and distinct. Then the n solutions are 


AIX Yn = e”, (2.75) 


y= e 


These functions constitute a basis, and the general solution of Eq. (2.73) is 


y= qe + H Cne”. (2.76) 


OOOO. 


30 CHAPTER 2 Ordinary differential equations 


Case If: Complex simple roots. Complex roots occur in conjugate pairs since the 
coefficients of Eq. (2.73) are real, that is, A = y +i@ and A = y — iw. Then a solution 
corresponding to the roots À and A is 


y=e’*(A;coswx + Bı sin œx). (2.77) 


Case III: Real multiple roots. If à; is a root of order (or multiplicity) m, then a solu- 
tion corresponding to the m linearly independent solutions is 


y = (c1 + eX +.063x7 +. + Cm xT Ye, (2.78) 


Case IV: Complex multiple roots. Given the complex conjugate pair à = y + iw and 
à =y — iw, a solution corresponding to the roots à and à of order m is 


y =(Aı cos œx + Bı sinwx)e”* + (A2 cos wx + Bz sinæx)xe”* +... (2.79) 


+ (Am cos@x + Bm sinox)x” le”, 


Example 2.3.1. (Case I) Solve y” — 6y” + lly’ — 6y =0. 


Solution. The roots of the characteristic equation 23 — 6474+ 11A-6=(A-DOA 
2)(A — 3) = 0 are Ay = 1, Az = 2, and A3 = 3. Using Eq. (2.76), the general solution 
is y =cye* + c2e°™ + c3e™. [answer] 


Example 2.3.2. (Case II) Solve y” — 4y” + 8y' =0. 


Solution. The characteristic equation is A3 — 4A? + 8A = O or A (à? — 4A +8) = 0, with 
roots A; = 0, Az = 2 + 2i, and Az = 2 — 2i. The general solution given by Eq. (2.76) 
and Eq. (2.77) is y = c1 + e** (A; cos2x + Bı sin2x). [answer] 


Example 2.3.3. (Case III) Solve yY + 6y!Y + 12y” + 8y” =0. 


Solution. The characteristic equation is 29+ 644 + 1203 + 87 = 0 or à? (A +2)? =0. 
This equation has roots 4; = A2 = 0 and A3 = A4 = A5 = —2. Therefore by Eq. (2.78), 
the general solution is y = c1 + cox + (c3 + c4x + c5x*)e~2*. [answer] 


Example 2.3.4. (Case IV) Solve y” —7y© 4 29y — 75y + 131y” — 149y” + 
95y’ — 25y =0. 


Solution. The characteristic equation is A — 72 + 29,0 — 75. + 13143 — 
149A? + 952 — 25 = 0 or (A — 1} (A? — 2A + 5) = (A — IPA — 1 + 2i)(À— 1 — 
2i)]? = 0. This equation has a triple root 1 and double roots 1 + 2i and 1 — 2i. Hence 
by Eq. (2.78) and Eq. (2.79), the general solution is y = (cy + cox + c3x?)e* + 
(A, cos2x + Bı sin2x)e* + x(A2cos2x + Bo sin2x)e*. [answer] 


ee) = = = — 
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Nonhomogeneous equations 
For the nth-order nonhomogeneous differential equation given by Eq. (2.68), the nth 
d” 
derivative is y™ = E and r(x) is continuous on I. Eq. (2.68) has no singular 
x 


solutions and a general solution is of the form 


y(X) = yax) + yp), (2.80) 


where yp is a general solution of the mth-order homogeneous linear differential equa- 
tion in Eq. (2.69) and yp is any particular solution of Eq. (2.68), which can be 
determined as follows. 


Method of undetermined coefficients 


Similar to Section 2.2.5 for second-order differential equations, this method gives yp 
for the constant coefficient equation 


y™ + any?) +... Fay! +agy=r(x). (2.81) 


In fact, the only difference from Section 2.2.5 concerns the modification rule since 
now the characteristic equation of the present homogeneous equation 


y™ + ap_-py"—) +... tay’ +acy =0 (2.82) 


can have multiple roots (that is, greater than simple or double roots). The basic rules 
are summarized as follows: 


(a) Basic rule as in Section 2.2.5 with Table 2.4. 

(b) Modification rule. If the choice for yp is also a solution of the homogeneous 
Eq. (2.82), then multiply yp(x) by x*, where k is the smallest positive integer 
such that no term of x*yp (x) is a solution of Eq. (2.82). 

(c) Sum rule as in Section 2.2.5 with Table 2.4. 


Example 2.3.5. (rule b) Solve y” + 6y” + 12y’ + 8y =6e7**. 


Solution. 

(i) The characteristic equation 34+ 6424+ 12A +8 = (A+ 2)? =O has a triple root 
à = —2. Hence by Section 2.3 (Case III), yp = cje7* + coxe~>* + €3x7e77*. 

Gi) With yp = Ce~*, one obtains —8C + 24C — 24C + 8C = 6 (that is, there is 
no solution). Therefore, by rule (b) choose yp = Cx3e-2*. As such, Yp = C(—2x? + 
3x7\e >, y= C (4x? — 12x? +6x)e 7”, and yy = C(—8x3 +36x? —36x+6)e~2*. 
Substituting these expressions into the differential equation gives (—8x? + 36x? — 
36x + 6)C + 6(4x? — 12x? + 6x)C + 12(—2x? + 3x7)C + 8x3C = 6. Simplifying, 
one obtains C = 1. Thus, the general solution is given by y = ya + Yp = (c1 402% + 
c3x2)e—2* + x3¢e72*. [answer] 


OOOO. 
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Method of variation of parameters 

This method can be applied to find a particular solution y, of the nth-order nonho- 
mogeneous linear differential equation in Eq. (2.68). The particular solution on some 
interval I is determined from 


W(x) Wr (x) 
Wa) r(x)dx +...+ Yn WO) 


Wi (x) 
yp@)=y | ——~r(x)dx + y2 r(x)dx. 


W(x) 


(2.83) 


Here y1, ...Yn is a basis of solutions of the homogeneous equation in Eq. (2.69) on I, 
with the Wronskian W, and W;(j = 1, ..., n) is obtained from W by replacing the jth 


column of W by the column [0 0 ... 0 1]. For example, when n = 2, W = 71 2 , 
z3 2 
0 yo yı 0 
Wi = =—y2, W2 = =y]. 
1o» yol 


Eq. (2.83) becomes identical with Eq. (2.64). 


2.4 Systems of differential equations 
Introduction of vectors and matrices 
In the analysis of linear systems of differential equations, that is, of the form 


= A11y1 +412y2 + ... + AInYn, 
= a21y1 + 422y2 +... + A2nYn, (2.84) 


yi 
y2 
Yh = Anyi + an2y2 +... + Ann Yn, 


one typically appeals to the use of matrices and vectors. A brief review of matrices is 
given in this section. The coefficients of Eq. (2.84) form an n x n matrix: 


aii 412 din 
a2) AN2 ~. An 

A= [ajk] = f i A (2.85) 
An] 4n2 Ann 


ee = = — 
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Similarly a row vector v is of the form 


v = [v1 U2... Vn]. 


Calculation with matrices and vectors 


Let A = Al le and B = bu bi , as well as v = vI and x = ee : 
an an ba, bn v2 x2 


ve aii + b11 ate En 
Addition: A + B = and v + x= . 
e +b2, an + b22 


E A | kay kar _ [kvi 
Scalar multiplication: ra= [fan a and kv= kip |" 


Matrix multiplication: The product C = AB of two n x n matrices A = [ajx] 
n 

and B = [bjx] is the n x n matrix C = [cję] with entries cj = D a jmXmk 
m=1 


with j = 1,.... and k = 1,...n. For example, AB = | i É i = 


—3 0j |-2 1 
2.-5+1.(—2) 2-34+1-1 _ | 8 7 
Eo (—2) (—3)-3+0- i E E —9 
multiplication is noncommutative). Similarly, v = Ax, where A is an n x n ma- 
trix and x is a vector with n components. Therefore, v is a vector with n components: 


n 
vj= 2 ajmXm, Where j = 1, ...n. For example, E 0| | = Ea 


l Note AB ~ BA (matrix 


Differentiation: The derivative of a matrix (vector) with variable components is 
yı(t) 
y2 (t) 


obtained by differentiating each component. For example, if y(t) = | 


. ri 
sinh 4t Taye yi) = 4cosh 4t l 
cost y5(t) —sint 


ai] 412 413 ai] 421 431 
Transposition: If A= | a2; an an |, then AT = |an an a32 |, and if v = 
431 432 433 413 23 433 


el then v? =[v v2]. 
v2 


Using matrix multiplication and differentiation, Eq. (2.84) can be written as 


7i all a2 .. ln yı 
X2 a a us a y2 
y = Ay, where y =| |, A= ah ae an! andy= 
! Anl 4n2 ++ Ann 


Yn Yn 


a 
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Linear independence: We say that r vectors v), ..., v) with n components are lin- 
early independent if c1 vO + Hev =0 implies that all scalars c1, ..., c, are zero 
(note that 0 denotes the zero vector). 


Inverse of a matrix: AA! = AIA =I , where J is the unit matrix (that is, the 
main diagonals with unit entries and all other entries are zero). If A has an in- 
verse, then it is called nonsingular; otherwise it is singular. For example (for n = 2), 


1 a —a ; . a a 
AT! = ae 12 , where the determinant of A is detA = * 12) = 
detA |—-421 411 ae a2) 422 
411422 — 412421. 
Eigenvalues and eigenvectors: Consider the equation 
Ax = ìx, (2.86) 


where A is a given matrix, À is a scalar (real or complex) to be determined, and x is 
a vector to be determined. A scalar à where x Æ 0 is called an eigenvalue of A, and 
this vector x is called an eigenvector of A. Eq. (2.86) can be equivalently written as 


(A—ADx =0, (2.87) 


which corresponds to n linear equations in the n unknowns x1, ..., Xn. For example, 
forn = 2, 
ai-A an xı 0 
= 2. 
[wa eataa lol a 


(aii — à)xı + a12x2 = 0, 


or in components, 
(2.89) 
a2ıx1ı + (a22 — A)x2 = 0. 


For a solution to Eq. (2.87), A — AJ must be singular, which results if and only if 
det(A — AJ) (that is, the characteristic determinant of A) is zero: 


aj — À a 
det(A — AI) = a iz L a| = (aut — A) (a22 — A) — a12021 (2.90) 


=)? — (ay tam) + ayyaz2 — ay2a21 = 0. 


This quadratic equation is called the characteristic equation of A. Its solution are the 
eigenvalues 4; and A. The process of solution is the following: 


(i) Determine A; and A2 from Eq. (2.90). 
(ii) From Eq. (2.89) with 4 = Ay, determine the eigenvector x“), and then with 4 = 
42, the second eigenvector x2), 


Note that if x is an eigenvector of A, then so is kx (for k #0). 


ee) = = 
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Example 2.4.1. Find the eigenvalues and eigenvectors of A = F k I 
Solution. 


(i) The characteristic equation is det(A — AJ) = ie =a : 


A) —2=A(A + 3) = 0. This yields 4; = 0 and Az = —3. For A = A, in the first 
equation of Eq. (2.89), —2x; + x2 = 0, which gives xı = 1 and x2 = 2. Also from the 
second equation, the result is 2x; — x2 = 0 (however, this latter equation gives the 
same result). 


Hence, an eigenvector to Ay = 0 is x“) = pi Similarly, x® = Él is an eigen- 


vector to Az = —3. [answer] 


2.4.1 Basic concepts and theory 


Many problems of engineering lead to systems of differential equations. Of central 
interest are first-order systems, 


y = Silt, Vi, -> Yn), 


= Pt, Y1,- Yn), (2.91) 
y = falt, Yi, +) Yn). 


In addition, an nth-order differential equation 
y™® = F(t, y, y’, YTP) (2.92) 


can always be reduced to a system of first-order differential equations by setting 


NHS) y3 = Y", e Yn = yP. (2.93) 
Therefore, 
Yi = V2, Vy = V3.) Yn) = Yn, and, from Eq. (2.92), y, = F(t, y1, y2---). (2.94) 


Extending the notion of a system of equations, Eq. (2.91) is a linear system if it can 
be written as 


Y= any +..+ainOyn + 81 (t), 


(2.95) 


yi = anı(t)yi +... + ann (t)Yn + gn(t), 


or in vector form, 


y =Ay+g, (2.96) 


OOOO. 
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ai) a2... Ain yı £l 
y2 8&2 
a a dis a . 
where A = | “7! at ,y=] . |,andg= 
Ani Gn2 ++» Ann Yn Zn 


If g = 0, the system is called homogeneous, that is, 
y' = Ay; (2.97) 
otherwise it is nonhomogeneous. 


The general theory of linear systems is similar to that of a single linear equa- 
tion (see Section 2.2.1, Section 2.2.4, and Section 2.2.5). A basis of solutions of 
Eq. (2.96) on some interval I is a linearly independent set of n solutions y, ..., y. 
A general solution of Eq. (2.96) consists of the corresponding linear combination: 


y= cy) tot cay (C1, .--, Cn are arbitrary constants). (2.98) 
The Wronskian of y®, ee y™ is defined as 


1 2 
yD ya 2. 9 


yı 
D (n) 
WO, y2 WD Ma | (2.99) 
1 2 
ge yP see gl 


The Wronskian of two solutions y and z of a single second-order equation is (see 
Eq. (2.56) of Section 2.2.4) 


y FA 
W(y,z)= y af (2.100) 


Using Eq. (2.56) and writing the equation as a first-order system, yj = y, y2 = y’, 
zı =z, and z2 =z’, W (y, z) takes the form of Eq. (2.99) with n = 2. 


2.4.2 Homogeneous linear systems with constant coefficients 


Consider the homogeneous linear system: 


y'= Ay, (2.101) 


where A has constant n x n entries. Substituting into Eq. (2.101) 


y=xe™ (2.102) 


re = = — 
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and dividing by e% yields the eigenvalue problem 


Ax = hx. (2.103) 


If the constant matrix A in Eq. (2.101) has a linearly independent set of n eigenvectors 
x, ...,x, corresponding to eigenvalues A1, ..., An, the general solution is 


y= cx Dei t.. exh ee., (2.104) 


—2yı + yo 


Example 2.4.2. Solve the system”) 
= ie y2 


and y2 (0) = 


with initial conditions yı (0) = 3 


Solution. 
From Example 2.4.1 for this system, the eigenvalues are 4; = 0 and Az = —3. The 
corresponding eigenvectors are, respectively, xT =[1 2]andx@7=[1 —1]. 


Hence, the general solution is y = cyy) + coy?) =] H +c [5] et, 
For the initial conditions y(0)? = [3 OJ], one obtains y(0) = c1 |, | +c) Ee = 


3 . Thus, Cera , which yields cj = 1 and c2 = 2. 
2c —c2 =0 


Therefore, the particular solution is y = $ +2 | j Ia r’! t F = 
= "y= = e 


[answer] 


Example 2.4.3. Consider the chemical kinetics problem in Fig. 2.3 with coupled for- 
ward and backward rate equations of first order for a closed system. In this process, 
there is desorption from a surface into the surrounding atmosphere with a back de- 
position onto the surface. The first-order rate constants are kj = 2 x 107? s~! and 
ky = 1 x 107°? s~!. The inventories on the surface, Nj (t) (mg), and in the surround- 
ing atmosphere, N2(t) (mg), are obtained from the following mass balance equations 
for the system: 


dN 
<a = ky No — ky Ni, 
dt 
dN2 
— =k, Ni- k Np. 
dt 


Initially, there is 300 mg on the surface and no inventory in the atmosphere, so that 
this system of equations is subject to the initial conditions N1 (0) = 300 mg and 
N2(0) = 0 mg. 


(a) Solve this homogeneous linear system for Nj (t) and N2(t) (this system of equa- 
tions is also solved by Laplace transform methods as in Problem 3.14). 


OOOO. 
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FIGURE 2.3 


Diagram of the chemical kinetics system. 


(b) What is the equilibrium value for N; and N2? Explain the physical significance 
of this result. 


Solution. 
Ni = —.02Nı + .O1N2 


, with initial conditions 
N3 = .02N;ı — .01 M2 


(a) The system can be written as 


N (0) = 300 and M2 (0) = 0, or N’ = AN so that H = E el “H and 
N(0)7 = [300 0]. 
—.02— à .01 

u =ð- i eee 
A)(.01 + A) — (.02)(.01) = A(A + .03) = 0. The eigenvalues are 4; = 0 and A2 = 
—.03. Eigenvectors are obtained from —(.02+A)x; + .01x2 = 0. For A; = 0, this sub- 
stitution gives —.02x; + .01x2 = 0, which implies x2 = 2x1. One can therefore take 
xT —[1 2]. For A2 = —.03, this substitution gives —(.02 — .03)x; + .01x2 = 0, 
which implies xı = —x2, and x®T =[1 —1]. 
Hence, the general solution is 


IMN] vO O |l 1 | o3 
w=[Nl=an +oaN”’=c1 2 +02 = e 3 


The characteristic equation is det(A — AJ) = 


> 


Using the initial conditions N(O) = cı B + ol 


which yields cı = 100 and c2 = 200. 
Therefore, the particular solution is 


= —.03t 
N = 100 B + 200] 1 | go! or Ni= 100 (1 Pea ) ; [answer] 
2 1 N2 = 200 (1 — e~-%*) 


1 | — (300|) c; +c2=300 
—-1} | OP 2c;-c2=0 


This physical problem is analogous to the system of equations solved in Exam- 
ple 2.4.2. 


(b) At equilibrium as t + oo, Nj = 100 and Nz = 200. A double inventory results in 
the atmosphere compared to the surface because the rate constant kı = 2k (twice as 
much material per unit time is leaving the surface than returning to it). [answer] 


ee = = 
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No basis of eigenvectors available 
Suppose jz is a double eigenvalue of A (that is, det(A — AJ) has a factor (A — wW’), 


for which there is only one eigenvector x. Hence, only one solution is implied: y® = 
xe, A second solution of Eq. (2.101) can be obtained by substituting 


y® = xte! + uel (2.105) 


into Eq. (2.101), yielding y®” = xe”! t+yxte! + uue! = Ay? = AxteM + AueM. 
Since ux = Ax, two terms cancel and dividing by e“ gives x + zu = Au. Thus, one 


obtains 
(A-—pwlu=x. (2.106) 
Example 2.4.4. Solve y’ = Ay = L l y. 
Solution. 
(i) The characteristic equation is det(A — AJ) = ee 1 i as 2? — 4 +4= 


(à — 2)* = 0. This equation has a double root A = 2. Eigenvectors are obtained from 
(3 — A)x} + x2 = 0. For A = 2, this implies x; + x2 = 0. Thus, taking xT = 

: = 1 1 _ —1 ui +u2 =-—1 
[-1 1], Eq. (2.106) gives (A — 2I )u = E a u = | I | Thus, irea 
and one can take u” =[—1 0]. Therefore, the general solution is y = c1y® + 


oy? =c | e +o (fm t+ E) e”, [answer] 


If A has a triple eigenvalue u and only a single linearly independent eigenvector 
x corresponding to it, one obtains a second solution Eq. (2.105) with u satisfying 
Eq. (2.106) and a third solution of the form 


1 
yO= satel + ute“ + ve", (2.107) 


where v is determined from 


(A—plv=u. (2.108) 


If A has a triple eigenvalue and two linearly independent eigenvectors x“) and x), 
then three linearly independent solutions are 


yD =xDVeHt yO = Oeri, y® = xte + ue", (2.109) 


where x is a linear combination of x“ and x® such that 


(A—plu=x (2.110) 


is solvable for u. 


OOOO. 
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2.4.3 Nonhomogeneous linear systems 


For the nonhomogeneous linear system 


y =Ay+g, (2.111) 


the general solution in which the n x n matrix A(t) and g(t) are continuous on some 
interval J is given by 


yay + y, (2.112) 


where y™ (t) is a general solution of the homogeneous system in Eq. (2.101) and 
yP)(t) is a particular solution of Eq. (2.111) (that is, a solution containing arbitrary 
constants). The particular solution is obtained as follows. 


Method of undetermined coefficients 


This method is applicable if the components of g are integer powers of t, exponential 
functions, or sine and cosine functions. 


Example 2.4.5. Solve 


1 —4 517 
‘=Ay+g= + : 2.113 
y YTE | 2 3 E E | ( ) 


Solution. 
The form of g suggests yP? = u + vt + wt?. Therefore, by substitution into 
Eq. (2.113), yo = v + 2wt = Au + Avt + Awt? + g. In components, one has 


v1 2wı| _ | ui —4u2 vı — 4v2 w — 4w2 2 512 
i + Pd = E + A T E + A g Ee + A Ea E —6/ 


e Equating t? terms: 0 = w; — 4w2 + 5, 0 = —2w1 + 3w2. These relations yield 
w1 = 3 and w2 = 2. 

e Equating t terms: 2w; = vı — 4v2, 2w2 = —2v1 + 3v2 + 6. These relations yield 
vı = —2 and v2 = —2. 

e Equating constant terms: vı = u — 4u2, v2 = —2u1 + 3u2 — 6. These relations 
yield uj = —2 and u2 =Q. 


From these results and the general solution of the homogeneous system in Sec- 
tion 2.4.2, the general solution is 


2 
MPa S| at 1] 5 |3 -2-2 
y=y +y c1 H e +c E e + l TY : [answer] 


CC... ll SUE 
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A modification is required if a term g involves eĉ, where A is an eigenvalue of A. 
Then instead of assuming yP) = we”, one lets 


yP = ute™ + ver. (2.114) 


Example 2.4.6. Solve 
i _|-2 1 —1| 3 
y=ay+e=[ Abell. l (2.115) 


Solution. 
From Section 2.4.2, the general solution of the homogeneous system is y™ = 


Cl h +c [3] e7’, Since 4 = —3 is an eigenvalue of A, using Eq. (2.114), 


yP) = ute! + ve~*, and substituting this equation into Eq. (2.115) yields y?” = 
ue™’t — 3ute—* — 3ve—* = Aute—* + Ave~* + g. Equating the te~~! terms im- 
plies —3u = Au (that is, u is an eigenvector of A with eigenvalue à = —3). Therefore, 
uT =a[l — 1]. Equating the other terms gives 


u-3v=av+|7)] or (A+3pu=u-[75]=a +f]. 


vı +v =a+1 
2v; + 2v = —a+2’ 
1 — vy. One can choose v? = [0 1]. The general solution is 


a 1 1 —3t 0 —3t 
y=C} Biz le + HE 2 [answer] 


Method of variation of parameters 
This method is applicable to the nonhomogeneous linear system 


Alternatively, in components, implying a = 0. Therefore, v2 = 


y =A(t)y + g(t), (2.116) 


with variable A = A(t) and general g(t). The general solution of the homogeneous 
system 


yP = ey H cny (2.117) 


can be written in components 


y = : = ; : | =Y (tc, (2.118) 


1 1 
cyh H + ony? yP -yP | Len 


OOOO. 
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where Y(t) is the matrix with the columns yD oes y”, the vectors of the basis in 
Eq. (2.117), and cD) = [c1...Cn] is constant. Thus Y obeys the homogeneous matrix 
equation 


Y'= AY. (2.119) 
Letting 
yP =Y (tult) (2.120) 
and substituting this equation into Eq. (2.116) yields 
Yut+Yu' = AYu +g. (2.121) 
Thus, using Eq. (2.119) in Eq. (2.121) gives 
Yu’ =g oru’ =Y~'g. (2.122) 


Integrating Eq. (2.122) 
t 
u(t) = I Y'@g(fdi+C. (2.123) 
to 
Equivalently, the general solution is 


t 
y=Yu=Yc+y | Y~'@)g(f)di. (2.124) 


to 


The particular solution y) is obtained by setting C = 0, that is, 


t 
y?) a Yedi. (2.125) 
to 


Example 2.4.7. For the system in Example 2.4.6, 


h) 1 et 
y =c B toj al (2.126) 
Solution. 
We have, Y=[y) y®]= Ea and g = =H 3 
i a =2 ` 
1 = 1 Cot p ii 
From Section 2.4, Y7! = — | >? »2| _ & 6 = 
det Y |-y21 yu —3e—3t 2 1 
if1 1 a ifi., iJa! oo 
3 ret et |" Therefore, Y~ g = 3 | 203! a || apt = . Inte- 
d ( 


t —e73t N 1 et _ 1) 
grating and choosing C = 0 yields u(t) = | 0 t=| 3 : 
) 


ee) = = — 
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Thus, 


1 1 
The last term can be rewritten as 5 B e7?! = (5 | l | + |) e`’. Hence, the 


general solution is 


y= y As yP = rs hl + ch | A et + H e”, [answer] 


Here the constants have been incorporated into the homogeneous solution of Exam- 
ple 2.4.2, such that c} = cı — 1/3 and c} = c2 + 1/3. This general solution is in 
agreement with Example 2.4.6. 


Method of diagonalization 
This method can be applied to the system 


y'= Ay + g(t), (2.127) 


for which A has a basis of vectors x“), ..., x. It can be shown that then 


D=X7—'!Ax (2.128) 


is a diagonal matrix with eigenvalues A“), ..., 4 of A on the main diagonal. There- 
fore, X isann x n matrix with columns x), ..., x. Note that XT! exists because 
the columns are linearly independent. Thus, for the eigenvectors in Example 2.4.6, 


1 1 Zod —2 1 1 1 0 0 
rD _ = 
X=[x%x i=l Jao; Alle aE eh I 
3 3 


Define z = X~!y and y = Xz, and substituting this into Eq. (2.127) (where X is a 
constant), one obtains 


Xz =AXz +8. (2.129) 
Multiplying Eq. (2.129) on the left by XT! gives 


7’ =X !AXz+h, (2.130) 


where h = X~!g. Using Eq. (2.128), one obtains z’ = Dz + h or in components 
zi =Aj2; + hj, where j = 1, ..., n. Each of these “decoupled” n linear equations can 


a 
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be solved as (see Section 2.1) 


Zp See test feet hj(t) dt. (2.131) 


These quantities are the components of z(t), and therefore y = Xz. 


1f1 1]| -e-* 
E a E = 
Example 2.4.8. For Example 2.4.6, one has h = X~ g = 3 E ie) | 
1 | —3e7?t —e3t 
3 0 = 0 . Since the eigenvalues are 4; = 0 and Az = —3, the diag- 


/ 
onalized system is z’ = S + h. Thus, ki 
0 —3 A = 


—3t 
$ e = 2 , 
solutions are z1 = c1 + > and z2 = c2e~*!. The general solution is 


yexe=|> ‘| + |_| atg Hee" 
2 -l1 ce 3! 2c) + 2a =tge 

1 1 ee poe : p 
Hence, y = cy +c ery 3\2 e`”. The last term can again be rewritten 


11i] -az ee ee 0 3t E 
slale ~A3]-1 + 1| pe >52 that the general solution is 


= 1 1 —3t O| -3 
yHcl p +c [fe +l. : [answer]. 


This result is again identical to Example 2.4.6 and Example 2.4.7. 


e73 
. From Eq. (2.131), the 
3z2 


2.5 Series solutions and special functions 


The power series method is the standard method for solving linear differential equa- 
tions with variable coefficients as 


y” + p(x)y’ +q(x)y =0. (2.132) 


A power series is of the form 


CO 


DE ain (x — xo)” = ao + a1 (x — x0) + a2(x — x0)? + ~», (2.133) 


m=0 


ee O 
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where do, a1, a2... are the constant coefficients of the series, xg is a constant (called 
the center of the series), and x is a variable. Letting x9 = 0, consider a series to 
Eq. (2.133) of the form 


CO 
y= X amx" = ao + aix + ax? + a3x°... (2.134) 
m=0 
with 
CO 
y= re =a; + 2anx + 3a3x7 +..., (2.135) 
m=0 
[0,6] 
y= > m(m — 1)amx""~? = 2a. + 3-203 + 4 - 3a4x? +... (2.136) 
m=0 


Example 2.5.1. Solve y” + y =0. 
CO 


[0,0] 
Solution. Letting y = X anx", the derivatives are y’ = X nax"! and y” = 


n=0 n=1 
CO 


Sinn =a", Substituting these expressions into the differential equation 
n=2 


OO CO 
yields x n(n — l)ayx"* + >. anx” = 0. Relabeling so that both expressions have 


n=2 n=0 
the same power of x (where n > n + 2) gives 


[(n + 2)(n + l)an42 + ay] x" =0. 


E 


0 


n 


Equating each coefficient to zero gives 


(n=0) 2- la =—ay > a =-F, m=) 3-2a3=-a1 > a3 =—F, 


ao ai 
m=) 4-34 =-m > u=F, (n=3) S-4as=-a3 > as =F, 
(n=4) 6-Sas=—as > ag=-5 | (n=5) 7-6as=-a3; > m=-2 

Th = d =i The solution is theref 
us = an = — 3 e solution 1S eretore 
> A2k (2k)! ao a2k+1 (2k + iy? 
SC DE S =t 2k+1 
yœ) = a (2k) “aid, Qk+ D!” 


= dog COS x + a), Sin x. 


The analytical result is also easily obtained from Section 2.2.2. 


an 
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2.5.1 Legendre equation 


Consider Legendre’s differential equation: 


(1 —x*)y" —2xy' +n(n + 1)y=0. (2.137) 


Assuming a power series solution 
[00] 
y=) amx” (2.138) 
m=0 
and substituting this equation and its derivative into Eq. (2.137) yields 
[0,0] OO [0,6] 
y= (1—x?) X mm- lax"? = 2x X mamx” 7! +k Y amx” =0, (2.139) 


m=2 m=1 m=0 


where k = n(n + 1). Thus, expanding 


[0,6] [0,6] CO [0,6] 
y= b> m(m — 1)amx"? — 5 m(m— 1)amx” —2 > mamx” +k 5 amx” =0, 
m=2 m=2 m=1 m= 
(2.140) 
or written out, 
2-la) +3-2a3x +4-3a4x? +- +(s +2)(s + 1)as42x5 +- 
—2. lax? — —s(s — l)asx* — 
—2. lax —2-2ax? —--- —2sayx* = 
kao +ka,x +kagx* +.-- tkasx' +++. =0. 


Equating the coefficients of each power of x to zero, 


xl: 2a2 +n(n + l)aọ = 0, 

x! : 6a3 +[—2+n(n + lla =0, 

xt (s =2,3...): (s +2)(s + Das+2 + [-5(s — 1)— 2s +n(n + 1)]as =0. 
=(n—s)(n+s+1) 


Therefore, 


(n—s)(n+s+4+1) 
(s +2)(s +1) 


as42 = as  (s=0,1,...). (2.141) 


EEE 


2.5 Series solutions and special functions 47 


From the recurrence relation in Eq. (2.141) one obtains 


n(n+ 1) (n — 1)(n +2) 
a2 = ~~, 0, a z hb 
a oars), j re Sead a 
4 = 4.3 25 s= 5.4 3; 
(n — 2)n(n+ 1)(n+ 3) (n — 3)(n—1)(n+2)(n +4) 
= 4! SE 5! í 


Inserting the values for the coefficients into Eq. (2.138), 


y(x) =aoyi (x) + a1y2(x), (2.142) 
where 
1 —2 1 3 
yi(x) = 1 nat lo mat Mes ) A R 
= (n — 1)(n+2) oa (n—3)(n—1)(n+2)(n+4) 5 
yo(x) =x 7 a 


Hence, Eq. (2.142) is a general solution of Eq. (2.137) for —1 < x < 1. 


Legendre polynomials 
If n in Eq. (2.137) is a nonnegative integer, then the right-hand side of Eq. (2.141) is 
zero when s = n, that is, dy42 = an+4 = An46 = ... = 0. If n is even, yı (x) reduces to 


a polynomial of degree n. If n is odd, the same is true for y2 (x). These polynomials 
multiplied by some constants are called Legendre polynomials. 


(s+2(s+1) 
From Eq. (2.141), as = <n—2) 50 that a= 
rom Eq. ( ), as a-si ds42 (s <n -— 2) so that an—2 
n(n — 1) 2n 


-n 2n D Choosing an = a(n ane so that the polynomial will have a value of 
unity when x = 1 gives 
_ n(n — 1)(2n)! 
an-2 = n — 12" (nly? 
_ A — 1)Zn (24 — 1)(2n — 2)! _ (2n — 2)! 
~ B(2fk — 1)2" 1)! 1 2)! "(n—- Din — 2)! 
Z (2h — 1) pn — D'A — 1)(n — 2) ( D! ) 
n! 
a (n —2)(n—3) (2n — 4)! 
Similarly, an—4 = Fen 3 |" = PIa- DnP! In general, when 
n— 2m 20, 
(2n — 2m)! 
An—2m = (—1)” À (2.143) 


2”m!(n — m)!(n — 2m)! 


OOOO. 
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From Eq. (2.143), the solution of Eq. (2.137) is the Legendre polynomial of degree 
n: 


M 
Pa(x) = D5 CD” o aaa (2.144) 


2'mi(n — m)l(n — 2m)!” 


m=0 


n—1 


n 
where M = 7 or (whichever is an integer). For example, 


Po(x) = 1, P\(x) =x, 


P(x) = : (3x? -1), P3(x) = ; (5x3 — 3x). 


2.5.2 Frobenius method 


The differential equation 


b 
ae O ey Le) (2.145) 
Xx XxX 


where b(x) and c(x) are analytic at x = 0, can be solved by a Frobenius (or extended 
power series) method. One solution exists for Eq. (2.145) of the form 


lo) 
y=x" 5 amx" =x" (a +ayx tanx7 + asx*...) : (2.146) 


m=0 


where r may be real or complex (and is chosen so that ap 4 0). To solve Eq. (2.145), 
this equation can be rewritten as 


n 


xy" + xb(x)y’ + c(x)y =0, (2.147) 


where b(x) and c(x) are expanded in a power series: b(x) = bo + bix + box? +... 
and c(x) =co+cix + c2x? +... Inserting the derivatives of Eq. (2.146), 


[0,6] 


y (x) = $ (mt ranx™ t! = x7! [rao + (r + Daix +], 
m=0 
CO 


y"(x) = Sim +r)(m +r — 1)amx” t"? 


m=0 


=x"? [r(r — 1)aọ + (r + Drax +...], 
into Eq. (2.147) yields 


x"Ir(r — lag + ...] + (bo + bix +...)x" (rao +...) 


7 (2.148) 
+ (co + cix + ...)x' (ag + aix + ...) = 0. 


se O 
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Equating the coefficients of the smallest power of x (x") to zero yields the quadratic 
indicial equation: 


r(r —1) + bor + co =0. (2.149) 


Thus, one of the two solutions will always be in the form of Eq. (2.146), where r 
is a root of Eq. (2.149). The form of the other solution depends on the root yielding 
three possible cases: 

Case I: Distinct roots not differing from an integer; 
yi(x) = x"! (ao + aix + ax? +...) F (2.150a) 
ya) =x"? (Ao + Aix + Ane? + g (2.150b) 


where the coefficients are obtained successively from Eq. (2.148) with r = rı and 
r=". 


Case II: Double root rı =r2 =r; 


yi) =x (a 4 ayx tanx? + ~) [ = ta z w|, (2.151a) 

yo(x) = yı (x) Inx + x” (Aix + A2x? +...) [x > 0], (2.151b) 
Case III: Roots differing by an integer; 

y) =x"! (ao + aix +apx? +...), (2.152a) 

yo(x) = ky (x) Inx + x” (Ao + Ax + Ax? + i) , (2.152b) 


where rı — r2 > 0 and k may be zero. 


Example 2.5.2. (Case I) Solve 2xy” + y! + y =0. 


Solution. Substitute Eq. (2.146) and its derivatives into this differential equation, ob- 


taining 
[0,6] [0,6] [0,6] 
2 Xoom +r)(m +r = Taya + Xom +r)amx” t" + > anx™* =0. 
m=0 m=0 m=0 


(2.153) 

Equating the sum of the coefficients of x”~! (that is, the smallest power of x) to 
zero yields the indicial equation 2r(r — 1) + r = 0, which implies that 2r? — r = 0. 
Hence, rı = 5 and r2 = 0. Equating the sum of the coefficients of x”** to zero (that 
is, taking m +r — 1 = r + s, which implies m = s + 1 in the first two series and m = s 


OOOO. 
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in the last series), 2(s +r + 1)(s +r)as+1 + (s +r + 1)as+1 +as = 0, or simplifying, 
(s +r + 1)(2s + 2r + las41 + as = 0. This latter relation implies 
as 


= 1, 2.154 
eae |S ) len 


ds 


—————— and thus 
(2s + 3)(s +1) 


(i) First solution: For r = $, Eq. (2.154) gives ds41 = 


ao a a2 b . bstituti 
= 0 = , 43 = , or by successive substitution, aj = ———, 
ay 3.1 a2 5.2 a3 73 y ai J] 
a = E pL =A aT or in general (with ag = 1), am = 
(=1)” 2" 
———.(m =0, 1, ...). Therefore, 
(2m + 1)! 
[0.0] [0.0] 
—1)” 2” 1 (—1)™ (2x) 2m+b/2 1. 
1/2 ( a 7 UE 
KJER x = = sin(v 2x). 
y1@) 2 (2m +1)! TÈ (m+! Gao 
[answer] 
“a l As 
(ii) Second solution: For r = 0, Eq. (2.154) gives As+1 = (s= 


(+ D254) 


Gi adie pe i, EE E E 
’ a se 1 = 1-1 2. = pe i 3 = 3.5° 1= 1- i 
A a ee ee | 1 (with Ap = 1), A ones 
= , =— . In general (wi = 1), = 
[= (a? Ogee "8 0 m= (Om)! 
0O [0,6] 
(=1)” 2" (—1)”( 2x)" 
Therefore, y2 (x) = D eoad = Ze ao = cos(v 2x). [answer] 


Hence, the general solution is y = c1 sin(./2x) + c2 cos(./2x), which is in agreement 
with the form of Eq. (2.150). 


1 
Example 2.5.3. (Case II) Solve x7y" — (2x?) y’ + < + z) y=0. 


Solution. Substituting Eq. (2.146) and its derivatives, 


foe) CO [0,6] 
= (m+r)\(m+r—1)anx™*" —2 5 (m +r)amx” ttl + nS dager 
m=0 m=0 m=0 


1 [0,6] 
+ 4 X amx” =0. 


m=0 


Equating the sum of the coefficients of the smallest power x"~! to zero yields [r (r — 
1) + 1/4] = 0, which implies that rı = r2 = 1/2 (there is a double root). 


(i) First solution: Equating the sum of coefficients of x”™s to zero with m = s in the 
first and fourth terms, m = s — 1 in the second term, and m = s — 2 in the third term, 


se — Eee 
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(s+tr)(s+r— lay — 2(s +r — 1)ds_1 + as—_2 + las =0or[(s+r)(s+r—1)+ 
la, — 2(s +r — l)as—1 + as_2 = 0. For r = 1/2, this expression reduces to s?as = 


g 
2s — l)as—1 — as— 
2(s — 5)As—1 — ds—2. For s = 1, aj = ao and for s > 2, as = 2 Lt IT 
sS 
3a, — ag ao Sa2 =j 5ao/2 — ag 3ao ao 
Hence, a2 = = —, 43 = = = = —, and a4 = 
n 4 2! 9 9 18 3! 
m and so on. By choosing ap = 1, 
2 3 4 
Ke < x x 
o= [ia T + F + F + Ai +. =e. [answer] 


(ii) Second solution: Using the method of reduction of order (Section 2.2.4), y2 = 


1 
yı f Uax, where U = en Spas for the differential equation y” + p(x)y’ + 


X1 
1 
q(x)y = 0. Thus, for the equation y” — 2y’ + (1 + =) y =0, one has p = —2 
ex 1 1 
and — | (—2)dx = 2x. Hence, U = —- = — and yp(x) = ve f tax = 
xer* x x 


x!/?e* (Inx +c). [answer] 


Thus, the first and second solutions are in agreement with the form of Eq. (2.151). 
The general solution is therefore y(x) = x!/e* [c1 + c21nx]. 


Example 2.5.4. (Case III) Solve xy” + 2y’+ xy =0. 


Solution. Substituting Eq. (2.146) and its derivatives, 


(oe) (oe) [0,6] 
> (m+r)(m+r — ljapx” t"! +2 > (m +r)amx” t"! + Xo amx” "tl — 0, 
m=0 m=0 m=0 


or simplifying, 


(oe) CO 
Xon +r)(m +r + lap x” "T! 4+ me amx” t+! = 0, 
m=0 m=0 


The lowest power of x implies the indicial equation is r(r + 1) = 0. The roots are 
rı = 0 and r2 = —1, which differ by an integer. 


Set m = s + 1 in the first series and m = s — 1 in the second series so that 
(s+r++1)(s +r +2)as+ı +as_) =0. (2.155) 


(i) First solution: 


an 
52 CHAPTER 2 Ordinary differential equations 


Inserting r = 0 in Eq. (2.155), (s + 1)(s + 2)ds41 + as_; = 0, yields as4) = 


i! forbs Dadar fots = 0. Hence arz 0, Se A 
(s+1)(s +2) T 2-3 3! 
a = 0, a O aA and go on Bente wig = cjr” and 

eke (eee T ee i a Om + D! 


a2m+1 = 0. Taking ap = 1 yields 


xo x 1 x x sin x 
yi(x) = beget ae = E x a + 5 ofS < [answer] 


(ii) Second solution: Using the method of reduction of order (Section 2.2.4), y2 = 


1 
yı f Uax, where U = =e Spas for the differential equation y” + p(x)y’ + 


yı 
2 
q(x)y = 0. Thus, for the equation y” + — y’ + y, one has p = 2/x and — fom dx= 
x 
2 x? 1 1 sinx dx sin x cot x 
Inx 4. Hence, U = ——— -z = —~ and y2(x) = == = 
sin’ x x sinf x x sinf x x 
cos x 
. [answer] 
x 
One obtains the same result using the second root of the indicial equation, r = —1 in 
Eq. (2.155). 
Thus, for this case k = 0 in Eq. (2.152) and the general solution is given by y(x) = 
sin x cos x 
cı — 
x 


Example 2.5.5. Sometimes a differential equation can be transformed into a simpler 
one using a suitable transformation. For instance, solve x*y” + xy’ + 4y =0. 


Solution. 


dx : : . n : 
Let x = e*, so that ae = e*. Hence, using this transformation with the chain rule, 
Z 


d 
the first and second terms of the equation can be rewritten. Given that y’ = = = 
x 
dy dz d 
st A e LA the second term becomes 
dz dx dz 


ee = 
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so that 
2 d 2y dy 
x y =| —5 -> }. 
dz? dz 
Hence, the differential equation is transformed into a linear differential equation with 


constant coefficients: oo + 4y = 0. The solution to this latter differential equation 


is y= Acos2z+ B sin 2z. Substituting in z = ln x, the final solution is 


y = Acos(2Inx) + Bsin(2Inx). [answer] 


2.5.3 Bessel’s equation 


The Frobenius method can be used to solve Bessel’s differential equation: 


xy" tay + a? —2 jy =D, (2.156) 


or in standard form, 


n 1 1 v? 


Substituting a series of the form 


[0,6] 
y= Y amx” (2.158) 


m=0 


and its derivatives into Eq. (2.156) gives 


[0,6] CO [0,6] 
5 (m+r)(m +r — 1l)amx” t" + > (m +r)amx”™" + > amx” ttt? 
m=0 m=0 m=0 
[0,6] 
-y > amx” ™" =0. 
m=0 


Thus, letting m = s for the first, second, and fourth series and m = s — 2 for the third 


yields 
r(r — l)ao + rao — v?ao =0 (s=0), (2.159a) 
(r + l)rai + (r + Day — vay =0 (s=1), (2.159b) 


(s+r)(s+r— las +(s+r)as + ds—2 — vas =O (s=2,3,...). (2.159c) 
From Eq. (2.159a), the indicial equation is 


(r+v)(r—v)=0 (2.160) 


OOOO. 
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with roots ry = v and r = —v. For r = rı = v, Eq. (2.159b) yields a; = 0 and 
Eq. (2.159c) may be written 


(s +2v)sas + ds_2 = 0. (2.161) 
Since a; = 0 (and v > 0), Eq. (2.161) implies a3 = 0, a5 = 0. Setting s = 2m in 
1 


Eq. (2.161), dam = —-=>———~42m-2. (m = 1,2, ...), or successively, this rela- 
22m(v +m) 
ao 


E a? _ ao 
(y+ 1)’ m 22.204) 242'v+ 1)(v +2)’ 


tion gives a2 = or gener- 


ally 
(—1)"ao 
22mm!(v + 1)(v + 2)...(v +m) 


42m = 


(m=1,2,...). (2.162) 


Bessel function of the first kind 
1 
(i) If v = n = an integer and choosing ag = Fini (so that n!(n + 1)...(n + m) = 
n! 
(n + m)!, Eq. (2.162) becomes 
(=1)” 


= m+n m! +m)! (m= 1,2,...). (2.163) 


a2m 


A solution to Eq. (2.156), using Eq. (2.158) and Eq. (2.163), is the Bessel function of 
the first kind of order n: 


[00] 


7 (Six 
In(x) =x > mtam! (n + m)! (2.164) 


m=0 


(ii) If v Æ an integer, the gamma function arises, which is defined by the integral 


lo) 
rv) =| et’! dt (2.165) 
0 


and has the property 


Tw +1)= vrv). (2.166) 


Also, the gamma function has the property that T (5) = yx and T (1) = 1. The 
gamma function generalizes the factorial function, where by Eq. (2.166) 


Pintl=n! (n=0,1,...). (2.167) 


Hence, the Bessel function of the first kind (of order v Æ an integer) is 


[0,0] 


no =x” > ("x (2.168) 
ea 2m+VmIT(v +m +1) , 


m=0 


ee O 
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The general solution of Eq. (2.156) for v Æ an integer and for all x Æ 0 is 


y(x) =c hx) + c2J_p (x). (2.169) 


However, for integer v =n, 
J_n(x) = (CDJ (x) (n=1,2,...), (2.170) 


and therefore J_,(x) and Jy (x) are not linearly independent and Eq. (2.169) (with 
v = n) is no longer a general solution. Therefore, a second linearly independent so- 
lution is required. 


Bessel function of the second kind (or Neumann’s function) 
A standard second solution Y, (x) is defined by 


Y (x) = [J (x)cosvr — J_y(x)], v¥0,1,2.... 


sın VIT 


Also Y, (x) = lim Y, (x). For the case n = 0, 1, 2..., the following series expansion is 
von 


obtained: 


(-1)""! (hm + Amtn) 2m 
22m+nm!(m +n)! 


2 x af 
a(x) = — A(x) (In5+¥)+— Do 


ine, (2.171) 
x" (n-m-1)! 5, 
T 22m—-n m! : 
m=0 
“1 
where y is Euler’s constant = 0.57721 and h, = > k with ho = 0. 
k=l 
Summary 
The complete general solution of Eq. (2.156) is 
y(x) =c1 J (x) + c2J_y (x), v Æ0 and not a positive integer, 
(2.172) 


y(x) = cy Jn (x) + c2Y (x), v=n =Q or a positive integer. 


Modified Bessel equation 
An alternative form of Eq. (2.156) is the differential equation 


x7y" + xy! — (x7 + vy =0. (2.173) 


a 
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This equation is transformed into the form of Eq. (2.156) by substituting x = iz. The 
complete solutions of Eq. (2.173) are 


y(x) = cı h(x) + c21- (x) for all v except v = 0 or positive integer, 
(2.174) 


y(x) = cı ln(x) + c2Kn (x) for n = 0 or positive integer, 


where J (x) = i™” J, (ix) (modified Bessel function of the first kind of order v) and 
T 

Kn(x) = x [Jn (ix) + iY, (ix)] (modified Bessel function of the second kind of 

order n). 


Generalized form of Bessel’s equation 
Consider 


x7y" + x(a + 2bx")y' + [c+ dx — b(1—a—r)x" +b?x*"]y =0.| (2.175) 


The generalized solution of Eq. (2.175) is 
r y |d ~ |d 
y(x) = y(174)/2 o7 bx" /r) [ez (L) +Z (4) (2.176) 
s s 


l-a 
2 
denotes a Bessel function as follows: 


2 
1 . 
where v = — ( ) — c, noting that v > 0 and a and s are constants. Here Z, 
sS 


S d l 
a Heh, | for vd real, v Æ 0, v Æ integer, 
a =J= Ss 
_ d 
b) Z=, for vd real, v = 0 or integer only, 
Z_y =Y, S 
_ d 
0) Z=h, for va imaginary, v £0, v Æ integer, 
Z= RY 
2 d 
d Zy=In, for — imaginary, v = 0 or integer only. 
Z_,»=K, S 


All of the Bessel functions and modified Bessel functions are tabulated. Fig. 2.4 
shows a plot of ordinary and modified Bessel functions of zero order. 


Recurrence formulas for Bessel functions 
The recurrence formulas are also valid for Y, (x): 


2v 
1. Jy41(x) = TW = Jy—1(%)3 


ee = 
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To + 


Bessel Functions of Zero Order 
i c 


# To-œ 
-2 
0 1 2 3 4 
FIGURE 2.4 


Ordinary and modified Bessel functions of zero order. 


1 
2. I(x) = z [Jy-1(%) — h); 
d 
3. — [x? Jy (x)] =x? J- x); 
dx 


d 
4. ax [x A@)] = =x" J41 (x). 


Functions related to Bessel functions 
Hankel functions of the first and second kind are defined by 


H® = J,(x) + iY), 
H® = J,(x) — iY, (x). 


Example 2.5.6. As shown in Fig. 2.5, a cylindrical wall surrounding a heated rod 
radially conducts heat. The heat escapes at the end of the wall into the air of tem- 
perature T4. The rod has a radius R with a surface temperature Tr. The wall has a 
length Z. 


Solution. 
(a) Heat balance 
For the volume element dV = 27x rldr , the heat balance is 
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Surface view 


Heat transfer coefficient, h 
Thermal conductivity, k 


TA 


Ta 


FIGURE 2.5 


Cylindrical wall surrounding a heated rod. 


dT 
Input = conduction (Fourier equation) = —k(2z7rl) T 


Output = conduction + loss to air 


dT dT 
=— [kerr +d (12m) | +h(2xrrdr)[T — T4], 
dr dr 


where (27rdr) is the surface area of the volume element in the last term. 
The rate of accumulation = Input — Output = 0 at steady state, so that 


dT dT dT 
—k(QQarl) — + k(Qa7rl) — +d | k2nrl— ) —h(Qzxrdr)[T — Ts] =0. 
dr dr dr 


Dividing by dr vield y2 dT hr(T — Ta) dT aT hr op Ta) 
ividin r yields kl— | r— ) =hr(T — orr = ; 

ii dr \ dr AET dr k ‘ 
; er. Lar h 
This latter equation can be rewritten as + B(T — T4) = 0, where 6 = —. 
dr? rdr kl 

Letting y = T — T4 and multiplying through by r°? gives 
r P20 (2.177) 
dr? dr f l 


Comparing with Eq. (2.175), a + 2br” = 1, so that a = 1 and b = 0. Also c + dr” — 
b(1 — d — F)r” +b?" = —r?° 6, which implies c = 0, s = 1, and d = —£. 


re O 
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From 


1 jf1-a\? 1-1\7 
v=- —c=1/(———]) —0=0, 
= 2 2 condition “d” applies. 


d 
[va = /-— 6 (imaginary), 

s 
Hence, Z, = I, = Io and Z_, = K, = Ko. Therefore, from Eq. (2.176), the general 
solution (where x = r) y(r) = c1 lo(r s/p) + c2Ko(r./B). The constants are obtained 
from the following boundary conditions: 


(i) T(r — œ) = T; (that is, a finite air temperature); 
(ii) T(r = R) =Tr. 


Using the first boundary condition, since Jo(r) —> oo as r —> œ (see Fig. 2.4), 
this implies cı = 0. Thus, y = c2Ko(r./B) or T — Ta = c2Ko(r/B). The second 
Tr— TA 


boundary condition gives TR — T4 = c2 Ko(R JVB) so that c2 = ————. 
Ko(R VP) 


T(r)— Ta E Ko(r VB) 
Tr — T4 Ko(R VB) | 


The final solution is therefore 


2.5.4 Sturm-Liouville problems 


The Bessel and Legendre equations, as well as other engineering equations, can be 
put in the general form of a Sturm—Liouville equation. For example, the Bessel equa- 


d? d d 
tion <2 +32 +82 —n2)y(%) =0, letting ¥ = kx (which implies = ~ and 
s dx? dx dx k 
d y if 


qe a reduces to x?y” + xy’ + (kx? — n?)y =0, or dividing by x, 
2 
A n 2 
[xy] + (-S +s) y=0. where À = k*. 
í 
Similarly, Legendre’s equation (see Section 2.5.1), 


(1 —x*)y"” —2xy'+n(n + Dy =0, 


can be written as [(1 — xy Y + ày = 0. Both equations are of the form of a Sturm— 
Liouville equation, 


roy T +a) + Ap@)ly = 0. (2.178) 


This Eq. (2.178) is considered on some interval, a < x < b, where p, q,r, and r’ are 
continuous and p(x) > 0. The boundary conditions at the endpoints are 


ki y(a) + k2y'(a) =0, (2.179a) 


OOOO. 
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ly(b) + hy'(b) =0, (2.179b) 


where kı and k2 are not both zero and /; and h are not both zero. The solution of the 
Sturm-Liouville problem (that is, Eq. (2.178) and Eq. (2.179)) is the eigenfunction 
y(x) for the associated eigenvalue À. 


Example 2.5.7. Find the eigenvalues and eigenfunctions of the Sturm—Liouville 
problem (r = p = 1 and q = 0) y” + Ay = 0 with y(0) = 0 and y(r) =0. 


Solution. For A = v?, y(x) = Acos vx + B sin vx. Using the boundary conditions, 
y(0) = A = 0 and y(x) = B sin vx = 0. The second boundary condition implies v = 
0, +1, +2.... However, for the case v = 0, one obtains the trivial solution y = 0. 
Therefore, the eigenvalues are A = v2, where v = 1,2,..., with the corresponding 
eigenfunctions y(x) = sin vx. [answer] 


Definitions 


(i) The functions yı and y2 are orthogonal on a < x < b with respect to a weight 
D(x) > Oif 


b 
1 P(X) ¥m(X)yn(x)dx =0 form én. (2.180) 


a 


(ii) The norm || ym || of Ym is 


b 
AE i f poroi (2.181) 


(iii) Functions are orthonormal if (i) applies and ||ym||=1. 


Example 2.5.8. The functions ym(x) = sinmx, m = 1, 2..., form an orthonormal set 
on the interval —z < x < x, that is, 


T 1 T 1 us 
f sinmssinnxa = 5 f cos(m—mdx— 5 | cos(m+n)dx=0 (mn). 


=H =H = 


Also ||Ymll = 4/ fZ; sin? mx dx = yT (m = 1,2,...). 


. sinx sin2x sin3x 
Therefore the orthonormal set is ... [answer] 


Orthogonality of eigenfunctions 
The eigenfunctions ym(x) and yn (x) of the Sturm—Liouville problem in Eq. (2.178) 


and Eq. (2.179) that correspond to the different eigenvalues àm and A,, are orthogonal 
on the interval a < x < b with respect to the weight function p. 


Note that: 
if r (a) = 0, then Eq. (2.179a) no longer applies; 


ee = 
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if r(b) = 0, then Eq. (2.179b) no longer applies; 
if r(a) = r(b), then Eq. (2.179) is replaced by the periodic boundary conditions 
y(a) = y(b) and y'(a) = y'(b). 


Example 2.5.9. Find an orthonormal set of eigenfunctions for the periodic Sturm- 
Liouville problem y” + Ay = 0 with y(x) = y(—z) and y(x) = y'(—z). 


Solution. From Example 2.5.7, the general solution is y = A cos kx + B sinkx (k= 
V1). Use the boundary conditions 


Acos(kz) + Bsin(kz) = Acos(—kz) + Bsin(—kz), 
— kA sin(kr) + kB cos(kz) = —kA sin(—kz) + kB cos(—kz). 


Since cos(—a@) = cos(@) and sin(—a) = — sin(@), the above equations give sin kr = 
0,A =k? =n? =0, 1, 4, 9.... The eigenfunctions are 


1, cosx, sin x, cos2x, sin 2x, .... 


These eigenfunctions are orthogonal, where 


T 1 T 
1 cos mx sinmxdx = >| sin2mx dx =0 


= =H 


as per Eq. (2.180). In addition, using Eq. (2.181), 
T 
1|| = f dx =V2n, 
-T 


T 
cosmx|| = / cos? mx dx =./, 
=: 


T 
sinmx|| = f sin? mx dx = /T. 
= 


. 1 cosx sinx cos2x sin2x 
The orthonormal set is [answer] 


Vin NTE? JTE Ja? Ja” 


Orthogonality of Bessel functions 
Recall the Sturm—Liouville equation for Bessel’s equation 


ri 
[x Jj (kx) + (-= + ex) In (kx) = 0. 


2 
Here p(x) =x, q(x) = a r(x) =x, and A =k’. Since r (0) = 0, the orthogonality 
theorem implies an orthogonality on the interval 0 < x < R. Thus from Eq. (2.179b), 


EEE 
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Jn (kR) =0 (n fixed), where KR = &mn and J;,(&mn) has infinitely many zeros (m = 
1,2,...). Thus, k = kmn = &mn/R. For each fixed nonnegative integer n, the Bessel 
functions form an orthogonal set J, (kinx), Jn(kanx), ... on the interval 0 < x < R 
with weight p(x) = x, such that 


R 
1 xJn(kmn¥)Jn(kjnx)dx=0| for jm. (2.182) 
0 


2.5.5 Eigenfunction expansions 


Eigenfunctions arising from the Sturm—Liouville problem can be used to develop 
series of given functions (for example, Fourier series), and provide the necessary 
techniques for the solution of engineering problems in heat conduction, fluid flow, 
etc. 


Notation 
For an orthonormal set with respect to p(x) > Oona<x <b, 


b 
Ym, Yn) = f P(X) ¥m(%) yn (x)dx = Sinn, 


where mn is the Kronecker delta function defined as 


1 if m=n. 


For the norm, || ¥m || = V (Ym, Ym) = y J? p(x) y2 (x) dx. 


Now if yo, y1,... are an orthogonal set with respect to p(x) on the interval a < 
x <b, then a given function can be represented by the orthogonal expansion or 
generalized Fourier series 


0 if msn, 
Ômn = 


[0,6] 


F= $ am Ym (X) = aoyo(x) + ayi) + (2.183) 


m=0 


Moreover, if ym are eigenfunctions of the Sturm—Liouville problem, Eq. (2.183) is 
called an eigenfunction expansion. 

As a result of orthogonality, the Fourier constants ao, a1, ... can be easily determined, 
where by multiplying Eq. (2.183) by p(x) yn (x) (n fixed) and integrating we find 


b b oo lee) 
(f, m= | pfndx= f (5 amn) Yn dx = XO am Om, Yn). 


m=0 m=0 


se O 
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Because of the orthogonality, the integrals (Ym, yn) on the right-hand side are zero 
except when m =n, where (yn, Yn) = ||Yn \|. Therefore, (f, Yn) = an || Yn |2. The co- 
efficients of Eq. (2.183) are given by 


s Jm 1 i 
p=% 5 = D(x) f )ym(x)dx | (m=0,1,2...). (2.184) 


2.5.6 Fourier series 
The Sturm—Liouville problem in Example 2.5.9 gave the orthogonal set 


1, cos x, sin x, cos 2x, sin2x,... 


on the interval —z < x < x with p(x) = 1. The corresponding eigenfunction expan- 
sion can be written as 


[0,6] 
f(x) =a + > (dm cosmx + bm sinmx). (2.185) 


m=1 


Eq. (2.185) is called a Fourier series of f(x) and the coefficients in Eq. (2.184) are 
the Fourier coefficients of f(x). From Eq. (2.184) and the norms derived in Exam- 
ple 2.5.9, the Fourier coefficients are given by the so-called Euler formulas: 


T 
Ba Peay. (2.186a) 
2m Jn 
1 T 
am = z f(x)cosmx dx (m= 1, 2,...), (2.186b) 
= 
1 T 
bm = z f(x) sinmx dx (m = 1,2,...). (2.186c) 


=i 


Example 2.5.10. Find the Fourier series of the periodic function in Fig. 2.6 (where 


f(x + 20) = f(x)): 


—x if -m<x <Q, 


rw=] 
x 


if O<x <z. 


Solution. Using Eq. (2.186), the Fourier coefficients can be evaluated as 


1 0 a 7T 


1 0 x 
an = = | (—x) cos mx dx + 1 (x)cosmx dx 
0 


= 


OOOO. 
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f(x) 


FIGURE 2.6 


Periodic function f (x). 


1 | cosmx  xsinmx |° 1 [cosmx  xsinmx |” 
= z e e 
T m m Me 
2 2 m odd, 
= — [-1 + cosmr] = — 5[- 1+(- 1 |= 
mm mm? 
m even, 
1 
bm = — Mes s)sinme dx + f (x)sinmsd | |= 
T LJ—x 


where (m = 1, 2, 3, ...). The Fourier series is 
xz 4 1 1 
f@= oe cosx + zz COS 3x + zz COS Se + oe Oe [answer] 


Example 2.5.11. (Fourier—Legendre series) As shown, the Legendre equation is a 
Sturm—Liouville equation: 


[0 —x*)y'! +Ay =0, 
with A =n(n+1),r=1-x?, q =0, and p = 1. Here r(—1) = r(1) = 0 is a periodic 
boundary condition for the Sturm—Liouville problem on the interval —1 < x < 1. For 


n = 0, 1,2... (that is, à =0-1,1-2,2-3,...), the Legendre polynomials P,(x) are 
the eigenfunction solutions. Also, by the orthogonality theorem, 


1 
f PREN =0 
= 


and the norm is 


1 
| 2 
I Proll = | Pawax= m41 (m =0, 1, 2,...). 


se O 
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The eigenfunction expansion for a Fourier—Legendre series is 


f(x) ) P, (x) P P 2 ' 
x)= a x)=a +a +... =a0+aıx+a =e = +..., 
i mim oro 1f 1 0 1 2 2 7 


m=0 


where using the norm in Eq. (2.184), the coefficients are 


2m+ 1 
2 


1 
ia 1 F(x) P(x) dx. 
-1 
Example 2.5.12. (Fourier—Bessel series) From Section 2.5.4, the Bessel functions 
form an orthogonal set, Jn (kinx), Jn(Konx), Jn (k3nX), ..., where n is fixed and kmn = 
Qmn/R on the interval 0 < x < R. From Eq. (2.183), the corresponding Fourier— 
Bessel series is 


CO 


F = Yo ain In KmnX) = a1 In (Kin) + az Jn (Kan) + ~~.. 


m=0 


The norm is given by 


R 2 
R 
|l Jn Kinnx) || = If xJ2(kmnx) dx = > Tny (kmn R). 
0 


The coefficients evaluated by Eq. (2.184) are 


2 f 
an = —— s xf (x)Jn(kmnx) dx. 
ý RZJ? (Ohne) 0 Gi 


Fourier series for functions of any period 
A periodic function f(x) is said to have a period p if for all x, 


f+ p) = f(x), 


where p is a positive constant. Eq. (2.185) and Eq. (2.186) are applicable to functions 
with a period p = 2m (e.g., see Example 2.5.10). If a function f(x) has a period 
p = 2L, then the Fourier series corresponding to f(x) is 


CO 
F(x) =a+ > (an cos nx + Bp sin =x) (2.187) 


n=1 


where the Fourier coefficients are 


1 L 
ao = zl. f(x)dx, (2.188a) 


OOOO. 
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1 f} ni 

a, = — f (x) cos —x dx (n= 1,2,...), (2.188b) 
LJ L 
1 fŁ nm 

by, = = f (x) sin —x dx (n= 1,2,...). (2.188c) 
LJ L 


Example 2.5.13. Find the Fourier series of the periodic function in Fig. 2.7 with the 
period p = 2L = 4 (that is, L = 2), where 


t if —-2<x <0, 
f@=, 2 
ae if O<x <2. 


f(x) 


FIGURE 2.7 
Function with a period p = 4. 


Solution. Using Eq. (2.188), the Fourier coefficients can be evaluated as 


If food tf (i+3)a if ~)d : 

ao = — x)dx = — x x=, 

aoa it A} 4 2 4 Jo 2 2 
0 


1 f? x nex 
+ (1 ) cos dx 
2 Jo 2 2 


| 
oo 
3 
Paes 
— 
N 


EEE 
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Therefore, the Fourier series is given by 


[answer] 


TX 1 30x 1 5x a, 
2 32 2 52 2 i 


1 4 
f@m==+ cos + -z cos +—cos 
2 m? 


Even and odd functions 

A function is called even if f(—x) = f(x) (for example, cos x). Similarly, a func- 
tion is called odd if f(—x) = — f(x) (for example, sinx). The function in Exam- 
ple 2.5.13 was even and had only cosine terms in its Fourier series. This result, in 
fact, is general where the Fourier series of an even function of period 2L is called a 
“Fourier cosine series,” that is, 


nit 


f(x) =ao + Yan cos -7 (f even), (2.189) 


n=1 


with coefficients 


L.f* 2.7% nr 
ao = > f(x)dx, an = = f(x) cos —x dx (n=1,2,3,...). (2.190) 
L Jo L Jo L 


The Fourier series of an odd function of period 2L is a “Fourier sine series,” that 
is, 


f(x) = Y basin ax (f odd), (2.191) 


n=1 


with the coefficients 
b =f fosna ed (2.192) 
== x) sin —x dx. 1 
” L Jo L 


Half-range Fourier sine or cosine series 

A half-range Fourier sine or cosine series is a series in which only sine or cosine 
terms are present, respectively. When a half-range series corresponding to a given 
function is desired, the function is generally defined in the interval (0, L) (which is 
half of the interval (—L, L), hence the name half-range). The function then specified 
as odd or even is clearly defined in the other half of the interval (—L, 0). As such, the 
half-range cosine series is given by Eq. (2.189) and Eq. (2.190) (for the even periodic 
extension), and the half-range sine series is given by Eq. (2.191) and Eq. (2.192) (for 
the odd periodic extension). 


Example 2.5.14. (Triangle and its half-range expansions for the function in 
Fig. 2.8) 


M OUO 
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f(x) 


FIGURE 2.8 


Half-range triangular wave. 


Find the half-range expansions of the function 


2 
—x if 0<x<L/2, 
fx)= 


zE» if L/2<x < L. 


Solution. 


(a) Even periodic extension 


From Eq. (2.190), 


A a 27" 1 
=e Wa d z= L— d = _, 
ao GS x seah x) -| 5 


272 rt? nx I fh ntx 
an= >| — x cos —— dx + — (L — x) cos — dx}. 
0 L L L/2 L 


Integrating the last expression by parts gives an = 


16 16 
Dagd? 0 ~ GPqg?? M0 = TR?” 


ni 
35 (2cos — — CoOsSnT — 1). 
nini 2 


Therefore, az = — 
2,6, 10, 14,.... 


. and an = 0 if n £ 


Hence, the first half-range expansion using Eq. (2.189) is 


rol (1 ggg 2 4b oos i ' 
= Cos COS water E answer 
A S ea ee J x 


ee = 
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This series represents the even periodic extension of the given function f (x) of period 
2L (see Fig. 2.9(a)). 


(b) Odd periodic extension 


8 
Similarly from Eq. (2.192), by = ae) sin — 
ner 


Hence, the second half-range extension is 


f(x) = 


2 sin sin + sin 


8 1 TX 1 . 3mx 1 . 5x 
7 [answer] 
L 32 L 52 L 


m2 


This series represents the odd periodic extension of f(x) of period 2L (see 


Fig. 2.9(b)). 
(a) (b) 
a 
-L 0 L x 
FIGURE 2.9 


Periodic extensions of f(x) in Example 2.5.14 for an (a) even and (b) odd extension. 


Dirichlet conditions 
Given are the following conditions: 


(i) f(x) is defined and single-valued except possibly at a finite number of points in 
(=L, L); 
(ii) f(x) is periodic outside (—L, L), with period 2L; 
(iii) f(x) and f'(x) are piecewise continuous in (—L, L). 


Then the series in Eq. (2.187) with coefficients in Eq. (2.188) converges to 
(i) f(x) if x is a point of continuity; 

a f(x +0)+ f(x — 0) 
(ii) 5 


In this theorem, f(x +0) and f(x — 0) are the right- and left-hand limits of f(x) and 
represent lim f(x + €) and lim. f(x — €), respectively, where € > 0. The conditions 
E> E> 


if x is a point of discontinuity. 


in (i) to (iii) imposed on f(x) are sufficient but not necessary, and are generally 
satisfied in practice. 


OOOO. 
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Parseval’s identity 
Parseval’s identity states that 


1 es = 
al {f@) dx =ap+ >> (a; + bi) 
-L n=1 
if ag, an, and b, are the Fourier coefficients corresponding to f(x) and if f(x) satis- 


fies the Dirichlet conditions. 


Complex notation for Fourier series 
Using Euler identities, 


et? = cos +isinð, 
T (2.193) 
e`" = cos — i sin0, 
where i = y — 1, the Fourier series for f(x) can be written as 
[0,6] 
f=) cepel, (2.194) 
n=—00 
where 
1 f} PEES 
c= oF J. fœ) "T/L dx. (2.195) 


Eq. (2.194) supposes that the Dirichlet conditions are satisfied and that f (x) is con- 
tinuous at x. If f (x) is discontinuous at x, the left-hand side of Eq. (2.194) should be 
replaced by [f(x +0) + f(x — 0)]/2. 


Example 2.5.15. Find the complex Fourier series of f(x) = e~”* over the range 
—m <x <a with the periodicity f(x + 27) = f(x). 


Solution. Using Eq. (2.195) with Maple in Appendix A, 


Qs 2 á 
T IT INT =FR =n 
1 EETA _ 1 fee" eTe 
Cn e e dx = ; 
27 


~ on J ( + in) 
ee (x — in) : Li , 
Multiplication by ————— and noting that e'”™ = e™'”™ = (—1)” gives 
p y Gin) g g 


1 m—in 2 2 sinha? / x —in 
= "| : p T |= 1)”. 
n= On (45) ae í x (5N ) 


oo OO O 
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Hence, the complex Fourier series using Eq. (2.194) is 
ax sinh r? 5 p” m—in inx ( ) i 
e = e -T <x <7). answer 
x n? +n? 


n=—Oo 


Note that from the above complex series, one can obtain the usual Fourier series as 
follows. Using the Euler identities in Eq. (2.193) (with 0 = nx), the term (a —in)e'”* 
in the series is 
(a — in)(cosnx +i sinnx) = m (cosnx +nsinnx) — i(ncosnx — m sinnx) 
(n positive), 
(a + in)(cosnx — isinnx) = m (cosnx +nsinnx) + i(ncosnx — m sinnx) 
(n negative), 
m (n=O). 


Hence, the imaginary terms cancel out in the series, and the real Fourier series is 


e Ts 


senai 1 
= 2sinhz~ | = — ———~ (cosx + sinx) + 


2 m+ (cos + sin 2x) = +... 


1 
m? +22 
where —7 <x <7. 


In summary, Fourier series are series of cosine and sine terms and arise in the 
important task of representing general periodic functions. They constitute a very 
important tool in solving problems that involve ordinary and partial differential equa- 
tions (see Chapter 5). 


Problems 
2.1 Consider the nonhomogeneous system of first-order, linear differential equa- 
tions y| = y2 + cosht and y, = yı with boundary conditions y;(0) = 0 and y2(0) = 
— 5. Solve this initial value problem by the following three methods: 

(a) Matrix methods. Use the method of variation of parameters to determine the 
particular solution. 


(b) Laplace transform methods. 


(c) Convert the two first-order differential equations into a single second-order 
differential equation and solve this latter equation. 


2.2 Consider the following problems: 


(a) State the differential equation for the Sturm—Liouville problem where r(x) = 
x, p(x) = x! and q(x) =0. 


ee nn 
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(b) Using the transformation x = e’, show that this differential equation reduces 
d? y 
to —— +ìy =0. 
dt? y 
(c) Given the boundary conditions y(1) = 0 and y(e) = 0 for the differential equa- 


tion in (a), what are the corresponding eigenvalues and eigenfunctions? 


a 
2.3 Using the transformation x = cosht, show that the differential equation = + 
d 
coth t — 20y = 0 reduces to Legendre’s differential equation and give the expres- 


sion for the Legendre polynomial which is a solution to this transformed equation. 


2.4 Consider the hypergeometric equation x(1 — x)y” + [b — (2 + b)x]y’ — by = 0, 
where b is a constant larger than unity. Using a Frobenius method, show that the series 
solution about x = 0 is the geometric series such that y1 (x) = 1 +x +x? +... 


2.5 Find a general solution, in terms of Bessel functions, for the differential equa- 
tion 4x?y” + 4xy’ + (x — n?)y = 0, where n is an integer, by using either the 
substitution or the generalized form for the solution of the Bessel equation. What 
is the solution that satisfies the condition |y(0)| < oo? 


2.6 Using J, for a series expansion, show that the coefficients of the Fourier—Bessel 
series for f(x) = 1 over the interval 0 < x < 1 are given as am = 2/ {Q@moJ1 (@mo)}, 
where &œmo are the zeros of Jo. 

i : 0 if —1<x<0O, 
2.7 Consider the function f(x) = | 2 if0<x<1. 
Calculate the first two terms of the Fourier—-Legendre series of f(x) over the interval 


—1 <x < 1. What is the value of the complete Fourier—Legendre series of f(x) at 
x =0? 


2.8 Consider the differential equation 16x? y” + 3y =0. 


(a) Calculate the two basis functions using a Frobenius method and give the gen- 
eral solution. 


(b) Derive the second basis function from y;(x) using a method of reduction of 
order. 


2.9 Give the Legendre polynomial P, (x) which is a solution to the differential equa- 
1 

tion = (1 — x?) y” — xy +3y =0. 

10n 5 ( x ) y xy +3y 


2.10 Find a general solution, in terms of Bessel functions, for the differential equa- 


„y dy 


tion x 72 +x ae (1 + 4x*) y = 0 by (i) using the transformation z = x? and (ii) 
x x 


employing the inspection method for the generalized form of Bessel’s equation. 


2.11 The steady-state temperature distribution for a fin of cross-sectional area A, 
constant perimeter P, constant conductivity k, and length L can be determined from 


ao OO O 
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the following differential equation: 


g ie hP(T —Ty)=0 
dx dx Ra e 


where A is the heat transfer coefficient for the fin surrounded by a fluid with a constant 
temperature To. 

(a) Is this differential equation homogeneous or nonhomogeneous? With a change 
of variable 0 = T — Tæ, find the general solution for the variable 6(x) by putting 
m? =hP/(kA). 

(b) Find a solution to the boundary value problem for the boundary conditions 
0(0) = o and 6’(L) = 0. 


2.12 Given the periodic function f(x) =x for —2 <x <2 and f(x +4 = f(x). 
(a) Sketch this function over the interval —6 < x < 6. 
(b) Find the Fourier series of this function. 


(c) What is the value of f(x) that one would compute from the Fourier series 
solution at the discontinuity of x = 2. 


2.13 Given is the differential equation xy” + (1 —x)y’ + ny =0. 

(a) Determine one of the basis solutions for this differential equation, when n = 2, 
using a Frobenius method. 

(b) The Laguerre polynomial L,,(x) is in fact a solution of this differential equa- 


Mam 


(x e” ). Using this formula, 


: f . _d 
tion as given by the Rodrigues formula L, (x) = e* Tan 


specify the Laguerre polynomial L(x). What is the value of the arbitrary constant in 


part (a) to obtain the Laguerre polynomial solution L(x)? 
2 


2.14 Consider the differential equation S — xy = 0. Using the transformations 
x 


2 
y =ur/x and ~ix*/* = z, where i = /—1, show that this differential equation re- 


duces to the Bessel differential equation 


What is the solution for u(z) and y(x)? 


2.15 Determine the particular solution for the nonhomogeneous, second-order, or- 
dinary differential equation y” + y’ — 2y = 4 sin 2x using the following methods: 


(a) undetermined coefficients, 
(b) variation of parameters, 
(c) Fourier transforms (see Problem 3.10). 


CHAPTER 


Laplace and Fourier 
transforms 


The Fourier transform and the Laplace transform are related quantities involving 
integral transformations to simplify the solution of problems involving differential 
equations. Both methods are used for solving differential and integral problems. 
Fourier transforms can be used for signal design; Laplace transforms for control the- 
ory. Both methods arise in engineering to simplify calculations in system modeling 
or, under certain circumstances, converting partial differential equations into ordinary 
differential equations. The properties of these transforms are discussed. 


3.1 Laplace transform methods 
3.1.1 Definition of a Laplace transform 


The Laplace transform of a function f(t) is defined as 


t 
LIOI=Fo= | e" ftdt (3.1) 


and exists according to whether the integral in Eq. (3.1) exists (converges). 

In practice, there will be a real number sọ such that Eq. (3.1) exists for s > So, 
and does not exist for s < so. The set of values s > sọ for which Eq. (3.1) exists is 
called the range of convergence or existence of /{ f(t)}. The original function f (t) 
is called the inverse Laplace transform of F (s) and is denoted by 


fO=Ff MFO). (3.2) 


The symbol -Z in Eq. (3.1) is called the Laplace transform operator, and is a linear 
operator (as well as & —1), that is, 


Ala fiO+apnO}=aL{AO} + ea RO. (3.3) 


Example 3.1.1. Find the Laplace transform of f(t) = 1. 
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Solution. We have 


lo) 1 T 
ziroy= f e™™dt= lim -se""| 
0 


T>0 S 0 


1 1 1 
= lim [-že=7 + Le =- (s>0). [answer] 
s s 


T— œ S 


The Laplace transforms of other elementary functions are listed in Table 3.1. 


Table 3.1 Laplace transforms of some elementary functions. 


f(t) ZIS) = F(s) f(t) Zf (t) = F(s) 

1 sS 
1 -, s>0 t =, s>0 

s> cos w arg § > 

! 

t n=1,2,34. a s>0 sin wt < z s>0 

s s? +w 

Tr 1 i 
tf, a>-l -a ) s>0 coshat T s > |a| 
at 1 7 h a 
e a s>a sinhat ogi’ s > jal] 
t”! 1 s— 
rw =z n> 0 e“ cos bt aoe s>a 

n s s—a)} + 
pr! eat 1 b 
o!=1 , n=l,2,3... e“ sinbt | ———.——~,_ s>a 
(n—1)! (s—a)" (s — a)? +b? 
! = 

te coe s>a,n=0, 1,2... | e% coshbt oe s—a> |b| 

s—a s—a)*— 
177! eat 1 b 

, n>O e“ sinhbt | ———.—~, _ s—a>|b| 
T(n) (s — a)" (s — a)? — b2 


Existence of Laplace transforms 


If f (t) is piecewise continuous on every finite interval in the range ¢ > 0 and satisfies 


ISOL < Me” (3.4) 


for some constants y and M, then Z {f (t)} exists for s > y. 


Uniqueness 

If the Laplace transform of a given function exists, it is uniquely determined. Con- 
versely, if two functions have the same transform, these functions cannot differ over 
an interval of positive length, although they may differ at various isolated points, that 
is, the inverse of a given transform is essentially unique. 


A 
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3.1.2 Laplace transforms of derivatives 


Laplace transforms are useful for solving linear differential equations, and therefore 
it is useful to find Laplace transforms of derivatives. 


Laplace transforms of first derivatives of f (t) 

Let f(t) be continuous, have a piecewise continuous derivative f’(t) on every in- 
terval in the range t > 0, and satisfy Eq. (3.4). Then #{ f’(t)} exists when s > y 
and 


Lf O}=sL{f(O}—- fO. (3.5) 


Laplace transforms of f(t) 


The above theorem can be generalized if f”) (t) is continuous, f™ (t) is piecewise 
continuous on every finite interval, and f(t), f’(t),..., f ™—l)(¢) satisfy Eq. (3.4): 


LIEPA =s"L(f@}—s™ | FO) —s" 7 f'0) —...-— Ff PO).| 86) 


Example 3.1.2. Let f(t) = t? and find the Laplace transform of Z {t°}. 


2 
Solution. Since f(0) = 0, f’(0) = 0, f”(t) = 2, and #{2} = 2Y{1} = — (see 
S 
Table 3.1), from Eq. (3.6), Z{f’(t)} = Lf} — sf (0) — f'(0). Hence, 
Lf" ()} = L{2} = — = S Zf —s- 0-0. Therefore, {t} = 2 (which 
S 


is in agreement with Table 3.1). [answer] 


“IN 


Example 3.1.3. Solve y” — y = t with y(0) = 1 and y’(0) = 1. 


Solution. From Eq. (3.6) and Table 3.1, the subsidiary equation is s?Y (s) — s y(0) 
wm 


=! 


1 1 
y (0) —Y(s) = — so that (s? — 1)¥(s)=s+1+-—. 
anaes, s2 s2 


=! 
s+1 1 1 


s2—1 22h s-—!l 


Hence, on solving the subsidiary equation, Y (s) = + 


1 1 
s2-1 s?) 


1 
The problem solution is therefore y(t) = YZ !{(Y(s)} = Se eer + 
Ta 


1 1 
27l }— Z7l{—}=e' +sinht—t. [answer] 
s2? —1 s? 
The general methodology for solving this problem with a Laplace transform method 


is shown below: 


E OOOO 
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General methodology 


t-space s-space 
1. Given problem = 
” a 2. Subsidiary problem 
y= y=t _ i 
y0) =1 (s? -D)¥=st1+5 
y@=1 ° 
Á. Problemi solution 3. Subsidiary solution 
wadoh m ie ee 
2 ~s—-1 s-—-1 32 


Example 3.1.4. Simultaneous first-order rate equations. 


Consider the set of first-order rate equations describing the consecutive reactions 


k k i 
A = B, B Ż C. The rate equations are 


dna 
dt 


dnp 
= —king and — = kina = kanp 

dt 
and the material balance between the initial and final products is given by 
Nao + nbo nco = Na +np + ne. 


Solution. 
(a) Direct integration 


The solution of the first rate equation is directly obtained by separation of vari- 


td 
ables (see Section 2.1): I A -f kı dt. Thus, on integrating, 
0 Na 0 
nalt) 
ln[na(t)] — Infng(0)] = —kıt or In} —— |= —kıt. 
y Nao 
Nao 


Hence, the solution is na(t) = naoe™™!! . Substitution of this result into the other rate 


equation results in a linear differential equation of first order: 


dnp _ 
— + konp = kinaot 
dt 


kıt 


D 
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Multiplying through by the integrating factor e*2’ (see Section 2.1): 


„dnp E dnpe*?! 
e"! + konpe! = kinaoe ay or oe) 


dt 


= kingoe ET, 


t t 
Integrating both sides of the equation, f (anse!) _ i kingoe ©! at yields 
0 0 


kin 
Nbo = ae e-hiz) i Hence, 
(kı — k2) 


kat 


np(t)e 


kiNao —(kı—k2)t 
np (them! — npo = [e (kiko) -1]. 
° (k= ki) 


kinao 


(ko — kı) 
Also by the mass balance, ne = nao + nbo + nco — Na — np. 


kin 
kıt —kot ao —kit —kot 
1 oe [ it _ e7% 1: 


Thus, simplifying, nj (t) = npoe "2! + [eet — e=]. 


Therefore, ne (t) = nao nbo +Nco —Naoe “!' —Nboe 
(k2 — kı) 
[answer] 
(b) Laplace transform 
The subsidiary equations become 
SNa — Nao = —ki Na, 
sNp — Nbo = kı Na = ky Np. 
Solving these two equations yields 
Nao 
Nal(s) = , 
a(s) Aas 
+kN k 
N,(s) = Nbo 1/Va Nbo 1na0 


= + 
ka +s kgo+s (kı +s)(ko +5) 


_ bo + kinao 1 1 
kg +5 (ky — ky) | (ky +5) (ka +s) ` 
From Table 3.1, 


nalt) = 2 '{Ng(s)} = nace ™", 


k ao ot an 
np(t) = L'{No(s)} = npoe ™™” + A kD [e Kit g arj, [answer] 


1 
Recall 27! | =! =e“ from Table 3.1. This result is the same as in part (a). 
sta 


Me 
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U(t-a) 


FIGURE 3.1 


Unit step function. 


3.1.3 The unit step function 


The unit step function, also called the Heaviside unit step function (see Fig. 3.1), is 
defined as 


Oif t<a, 
lif t>a. 


ue-a=| 


It is possible to express various discontinuous functions in terms of the unit step 
function. The Laplace transform of the unit step function is 


LU(t—a}= a s>0 
and 
ef] =U(t—a). 


3.1.4 Special theorems on Laplace transforms 


Some important results involving Laplace transforms (and corresponding inverse 
Laplace transforms) are detailed below. In all cases, it is assumed that f(t) satisfies 
the existence theorem. 


First shifting theorem (s-shifting) 
We have 


Le" f (t)} = F(s—a) 
and 


L'{F(s—a)} =e" f). 


D 


3.1 Laplace transform methods 81 


f(t) 


FIGURE 3.2 


Example of a second shifting function. 


For example, by applying the theorem and using Table 3.1, 


at te S—a 
Lie cos wt} = ae ee 
Second shifting theorem (t-shifting) 

We have 


L{U(t—a) f(t —a)} =e F(s) 
and 


LSe F(s)| =U(t—a) f(t —a). 


Example 3.1.5. Find the Laplace transform of 2{f(t)}, where f(t) is given by 
Fig. 3.2 such that 


2 if O<t<qz, 
f=; 0 if w<t<2n, 
sint if t>2z7. 


Solution. In terms of step functions, f (t) = 2U (t) — 2U (t — m) + U (t — 27) sint. 


2e7Ts eT ns 


By applying the theorem and using Table 3.1, LIFO = Fa : 
sS sS s4+1 
[answer] 


OOOO. 
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Integration of a function 


We have 
t 
F 
2i] fandu) 2A 
0 S 
and 
F t 
{=o =f f(u)du. 
s 0 
Example 3.1.6. Given the Laplace transform of #{f(t)} = oe find the 


s(s? +2)’ 
function f(t). 


1 1 
Solution. From Table 3.1, 47! | E | = — sin wt . Therefore, using the theorem, 
s+ ow w 


galfi = + f sinoudu= 4 cose [answer] 
s \s* +o? odo 


Integration of a transform 


We have 
e|) = f Fondu 
and 


s Fadu) = mo 


t 
if lim LO exists. 
t> 


{t= pak 
Example 3.1.7. Find z í l. 


=$ —t 


= lim &— = 1 from VHopital’s rule, 1 — e~’ 


is con- 


i= 
Solution. Since lim 


t—>0 t—>0 
tinuous, and Eq. (3.4) applies, the above theorem can be used. By Table 3.1, 


1 1 
Fil — e`} =-— a As such, by the theorem, 
s s 


1— e~ Ai 1 , Efi 1 
L = = du= lim — du 
t y u ut+i K>œ Js u u+l1l 
; K í 1 1 
= pm ne — hu + D] p = jin(1+2)-in(i+5)] 


1 
=In (1 + 5 : [answer] 
s 


S Ea 
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Differentiation of a transform 
We have 


Ar fore yt = (—1)" F™ (s), n=1,2,3,..., 
S 


and 


LF (s)} = CDt fO). 


For instance, for n = 1, L {tf (t)} =—F'(s). 


Example 3.1.8. Find the Laplace transform of -Z {t sin Br}. 


Solution. From Table 3.1, 2I sin wt} = 


PEET Thus, setting w = £ and using the 


d 2 
above theorem for n = 1, Lit sin pt} = 7 ( 5 £ =) = (2 at . [answer] 
s\s s 


Periodic functions 
If f(t) has a period p > 0, such that f(t + p) = f(t), then 


1 P 
LIFO} = el e" f(t) dt. 


Example 3.1.9. Given the square wave function f (t) in Fig. 3.3, find KARLON 


f(t) 


FIGURE 3.3 


Example of a square wave function with a period of 2a. 


n 
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Solution. Using the above result, 


a 2a 
£10) = =a (f ke dt + ; (—=k)e™"! ar) 


as 


Ook 1-2e7% tes kâle e? 
~ s 1te-%)(1—e-%) s (14e S 


e2 


k as 
= — tanh —. [answer] 
S 2 


Convolution theorem 
We have 


t t 
2l f fga- wau] aai fa -we au = F(s)G(s) 
0 0 


and 


t t 
L"{F(s)G(s)} = f fg- u)du = [ f(t —w)g(u) du. 


The above integral, called the convolution of f and g, is written as 


t 
f*g -=f f(u)g(t —u)du. 


1 1 
(2+D2 CFD CHD 


Example 3.1.10. Given H(s) = 
2A: 


find h(t) = 


1 
Solution. From Table 3.1, ee art | = sint. 
s 


Hence, using the convolution theorem, 


t 
ht) = 27'{H(s)} =sintxsint = | sinu sin(t — u) du 
0 


1 ¢ 1 1 1 
=z -cosrdu+ 3 f cos(2u — t)du = -5t cost + z sint. [answer] 


3.1.5 Partial fraction method 

In many problems, the subsidiary equation occurs as a quotient of two polynomials: 
P (s) 

O(s) ’ 


where P(s) and Q(s) have real coefficients and no common factors. The degree of 
P(s) is lower than Q(s). 


Y(s)= 


D 
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1 
Example 3.1.11. (unrepeated factors) Find %7! ee 
s(s — 2) . +3) 


s+1 A 


F T 
s(s—2)(s+3) s (s—2) s+3) 
To determine the constants A, B, and C, multiply by s(s — 2)(s + 3) so that 


Solution. One can set 


s+1= A(s —2)(s +3) + Bs(s +3) + Cs(s — 2). 


This relation must be an identity and thus it ner hold for all values of s Then, letting 
s = 0, 2, and —3 in succession, one finds A = —¢, B = jp, and C = — f}. Thus, from 
Table 3.1, 


io? 


e s+1 - aaj-y6, 3/10 -2/5 
lal : s P= z 
1 3 2 Pe 


answer 
6 d 10 15 l i 
Further, consider the two following cases: 
Case 1 (unrepeated factor) 
A 
As in the previous example, if Y (s) = on has a fraction ——— , then from Table 3.1 


the inverse transform is Ae“, where 


ii TOFS FO (3.7) 
s>a Q(s) QO'(a) 


Case 2 (repeated factor) 
Am Am-1 Al 


—a)"  (s—ay™-1 ` (s-a) 
then from Table 3.1, using the first shifting theorem, the inverse transform is 


If Y(s) has a sum of m repeated fractions, 


m—1 m—2 
at | An + Am- A A 
e [An + m Gao pT 25+ Ji 
where 
m 
— P 
Ain EO (3.8a) 
sa Q(s) 
and the other constants are given by 
1 qim-k — aP 
A= lim BIO) | gad. G 
(m — k)! s>a ds™—k Q(s) 


P 3— 4s? 44 
Example 3.1.12. (repeated factor) Find %7! | Y(s)= © 3 s | 


Q(s)  s?(s — 2)(s — 1) 


OOOO. 
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P 3 4s? +4 A A B C 
Solution. We have (s) a2 ie = as 4 i 
Q(s) s2(s—2)(s—1) s? sS (s—2) (s—1l) 
The constants A» and A, are first determined where a = 0 (Case 2). Using Eq. (3.8a) 
with m = 2 gives 


| P8 — 4s? +4) 58-457 +4 
A2 = lim = 2. 
s>0 g2(s—2)(s—1) (s-2)(s-—1)|,-0 
From Eq. (3.8b) with m = 2 and k= 1, 
3_ 4s? +4 
E (s s~ + 4) = 
s>0 z (s—2)(s— 1) 
From Eq. (3.7), the constants B and C can be determined as 
P(s P(s 
= (s) =-l and C= (s) = 
s? (s —1) |,_> s*(s — 2) |s=1 
The inverse transform is y(t) ZUY )} Z7! + f ` 
D X = S = : 
4 2's G-J G=) 
which yields the final result: 
y(t) = 2t+3-—e% -— e. [answer]. 
P(s) 


Finally, as a generalization of Case 1, where Y(s) = 


and P(s) =a polyno- 


mial of degree less than n, Q(s) = (s — a1)(s — Q2)...(5 — œn), Where a1, 2, ..., Hy 
are all distinct, then 


O' (on) 


P(s)| LS POW) sagt 
zigo- Lg | 


3.1.6 Laplace inversion formula 


Using the theory of complex variables presented in Chapter 10, one can find the in- 
verse Laplace transform with the complex inversion formula for certain cases where 
the inverse transform is not listed in standard tables. 

If F(s) = 2 { f (t)}, then -27!{F(s)} is given by 


201 —~i00 


1 y+ioo 
fMO= al e'F(s)ds t>0, (3.9) 
F 


D 
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y+iT 


y-iT 


FIGURE 3.4 


Contour integral for the Laplace inversion formula. 


where s is a complex number, that is, s = x + iy. In practice, the integral in Eq. (3.9) 
is evaluated by considering the contour integral 


1 st 
— Ọ e F(s)ds, (3.10) 
2i C 


where C is the contour shown in Fig. 3.4 of the line AB and the arc BJKLA of a 
circle of radius R. If the arc BJK LA is represented by I and T = y R? — y?, then 
Eq. (3.9) can be given as 


1 y+iT 1 1 
lim al e' F(s)ds = lim af e F(s)ds — al e'F(s)ds}. 
R>o 200i y-iT Roo | 277i C 2ri Jp 
(3.11) 
If constants M > 0 and k > 0 can be found such that on T (where s = Re!”) 


M 
|F (s)| < Re’ (3.12) 


then the integral around T of e*’ F (s) approaches zero as R —> oo. In fact, condition 
Eq. (3.12) holds if F(s) = P(s)/Q(s), where P(s) and Q(s) are polynomials and the 
degree of P (s) is less than the degree of Q(s). 


OOOO. 
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Thus, for the case Pu Ir e*' F(s)ds = 0, Eq. (3.11) can be evaluated with the 
> 00 


use of the residue theorem (see Section 10.5). This theorem states that if f(z) is a 
function that is analytic inside a simple closed path C and on C, except for finitely 
many singular points z1, Z2,..., Zk inside C, then 


k 


$ fds = 201 XO Res f(z), (3.13) 
C 


j=1,z=zj 


where the integral is taken counterclockwise around C. The integral in Eq. (3.13) 
is simply equal to the sum of the residues of f(z). Thus, comparing Eq. (3.11) and 
Eq. (3.13), 


fA = > residues of e” F (s) at poles of F(s). (3.14) 
A pole is a value at which F(s) becomes infinite. For example, if 


1 PCs) 
Q(s)" 


a pole is a root of Q(s) = 0. For a simple pole (that is, one which is not repeated), the 
residue when s = a is a root of the denominator of Eq. (3.15): 


e* F(s) = e° 


(3.15) 


st P(S) 
Resa = lim (s — a)e* ——. (3.16) 
sa Q(s) 
An equivalent formula that is also useful with transcendental functions is 
P 
Res, = e r (3.17) 
dQ(s)/ds | 54 
When a pole is repeated m times, the residue is 
1 at P(s) 
Resa = ue s i 3.18 
Em EEK Da iz ane 


In fact, Eq. (3.16), Eq. (3.17), and Eq. (3.18) are identical to the results given in 
Section 3.1.5 (Cases 1 and 2). 


Example 3.1.13. Evaluate %7! | | by using the method of residues. 


1 
(s + 1)(s — 2)? 


Solution. Since the function satisfies the condition in Eq. (3.12), we have 


Zj 1 ol [~ eds 1 $ eds 
(s+ D(s—2)2 J  2riJy-iv (6+1(s—22  2xri Je (8 + D(s — 2)? 


eds 
z at poles s = — 1 and s =2. 


= 5 residues of ———__—__~ 
(s + 1)(s — 2) 


D 
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St —t 
The residue at simple pole s = —1 is sim, o0] ANE = = = S , where 
Eq. (3.16) has been employed. 


The residue at the double pole s = 2 is evaluated using Eq. (3.18): 


St , d est 
i T Ts al s Gz a = E F =| 


. [te*(s+1)-—e*(1) te” e” 
= lim = 3 
s>2 (s + 1)2 3 9 
1 et t 2t 
Then %7! | aed >= residues = > + a — F [answer] 
Example 3.1.14. (Process control) 
Consider the linear equation 
dx dx dy 
— — +bx =c— + dy. 3.19 
ae oa T eO ( ) 


In control theory y is called the input and x is called the response or output function. 
A process is made up of elements, each of which has input and output signals. When 
the output signal is used to modify the input to an earlier element of a series, the 
system possesses feedback. 


All signals become known as a function of time by solving the system of dif- 
ferential equations. Usually, however, only the relation between external signals is 
necessary. It is also easier to work with deviations of signals from their steady-state 
values. Therefore, the deviations (and all of their derivatives) will be zero at zero time 
and the Laplace transform of a derivative (Section 3.1.2) reduces to 


a| 


T | = s” F(s). 


For instance, the transform of Eq. (3.19) is 


d 
(s? + as +b)X (s) = (cs + d)Y (s) or X (s) = (=) Y(s) =G(s)¥(s), 
s4+as +b 
; cs+d : 
where the coefficient G(s) = eee EA called a transfer function. An output 
se tas+b 


transform X(s), accordingly, is found by multiplying the input transform Y(s) by 
the transfer function G(s). Suppose that a particular input is constant, y(t) = k with 
transform Y (s) = k/s (see Table 3.1). Then 


k(cs +d) 


ee ae 
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Finally x(t) can be found by inversion with the aid of partial fraction expansion or 
with standard Laplace tables. 


Y(s) = 1/s X(s) 


negative feedback - 


- 7 
FIGURE 3.5 


Block diagram with negative feedback. 


Moreover, in the case of feedback, the transforms of an input y and an output 
x are related by the block diagram in Fig. 3.5. In equation form, the direct relation 
between transforms is obtained from 


[Y (s) — H(s)X(s)] G(s) = X (s), 


whence 


eis. OO ae OO) 1 
“TT 6@Ho LCAN (+). 


Thus, similarly, the output response can be obtained by inversion of X (s). 


3.2 Fourier transform methods 


Fourier series are used for periodic functions. However, the method of Fourier series 
can be extended to nonperiodic functions with the use of integral representations. 


3.2.1 Fourier integrals 


The Fourier integral theorem states that if 


(i) f(x) satisfies the Dirichlet conditions (Section 2.5.6) in every finite interval 
(—L, L), and 
CO 
(ii) f | f(x) 
—o0o 


dx converges, then 


f@a= i’ [A(w) coswx + B(w) sinwx] da, (3.20) 
0 


ee = = 
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where 


Aw== fo f(x) coswx dx, 
A (3.21) 


Bo f f(x) sinwx dx. 


The Fourier integral expansion of f(x) in Eq. (3.20) holds if x is a point of con- 


tinuity of f(x); otherwise if x is a point of discontinuity, f(x) is replaced by 
f(x +0) + f(x — 0) 
2 


. The above conditions are sufficient but not necessary. 


The similarity of Eq. (3.20) and Eq. (3.21) with corresponding results for Fourier 
series is apparent. 


Equivalent forms of Fourier integrals 
Substituting Eq. (3.21) into Eq. (3.20) gives 


1 OO [0,6] 
f@= >f f f(v){ cos wv cos wx + sinovsinox} dv dæ. 
T Jw=0 J v=—00 


Therefore, 
1 lo) CO 
f@= f f f(v) cos@(x — v)duda. (3.22) 
T Jw=0 J v=—00 
eio(x—v) 4 eTe xv) 
Since cos w(x — v) = , Eq. (3.22) becomes 


2 


ff) = =f fœ) if, (es + ei) do| dv 


1 œo o 0 ; 
f(v) p e2) do+ f e2) do| dv. 
w: 


27 v=—00 =0 w=— 00 


Therefore, this latter result can be written as 


= J / fE dudw 
27 J- J—oo 


ie a (3.23) 
=— ew |f fete dv do. 
27 Joo —oo 


OOOO. 
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3.2.2 Fourier transforms 
From Eq. (3.23), it follows that if 


F(a) = zÍ. foe dv, (3.24) 
then 
fœ)= = T. F(a)e'®* do. (3.25) 


The function F(w) = F { f (x)} is called the Fourier transform of f(x). The function 
f(x) is the inverse Fourier transform of F (œw), that is, f(x) = a {F(o)}. 


Example 3.2.1. Find the Fourier transform of the function in Fig. 3.6: 


if |x| <a, 


1 
sœ=] 0 if |x|>a. 


f(x) 
1 
x 
-a a 
FIGURE 3.6 
Square wave function. 
Solution. From Eq. (3.24), 
1 oo , 1 a , 1 e tov a 
F(o)= — | (vje '° dv = —— (le '?" dv = —— — 
~V 2T J—co f V20 J-a vV2m 0 |_a 


1 eid _ gina 2 sinwa £0 i 
= =a , Oo ; answer 
Wan iw I Oo 


ee = = 
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/2 
For w = 0, F(@) = ,/ —a. 
T 


Example 3.2.2. Evaluate the definite integral | 


65 w 


© sin wa cos wx a 


1 99 ‘ 
Solution. From Fourier’s integral theorem, if F(w) = a | f(v)e™® dv, then 
T J—oo 


1 2 ; 
(x)= = Faye da. 
f AV 27 =00 
1 if 
1 f>” [2sinoa ,o, w Bea, 
From Example 3.2.1, Ton ee dw = 5 if |x|=a, 
= T 
ZT id= i 0 if [xl>a. 
The left-hand side of the above equation is equal to 
1 f° sinwacoswx i (°° sin@wasinwx 
dw+ dw 
m J- w m J_oo w 
The second integral is zero since the integrand is odd. Thus, 
Si a x if |x|<a, 
sin wa COS Wx , 
f ————dwo=4 5 if |x|=a, 
Tn a 0 if |x|>a. [answer] 
°° sinwa 


dw =x or 


Moreover, if x = 0 and a = 1, then the result yields f 
w O 
f © sin wa T 
— dw=—. 
0 w 2 


Fourier cosine and sine transforms 
2 [0.6] 

If f(x) is an even function in Eq. (3.21), A(@) = =f f(x)cosæxdx and 
T JO 


B(w) = 0. Hence, Eq. (3.20) yields the Fourier cosine transforms 


Fco(@) = er f(x) cos wx dx (3.26) 


and 


f@a= ia Fc(@) cos œx do. (3.27) 
T JO 


a 
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2 (oe) 
Likewise, if f(x) is an odd function, A(w) = 0 and B(w) = — I f(x) sinwx dx. 
T Jo 


Hence, Eq. (3.20) yields the Fourier sine transforms 


Fs(w) = Er f(x) sin wx dx (3.28) 


f@= oe Fs5(@) sinwx dw. (3.29) 
T Jo 


3.2.3 Special theorems on Fourier transforms 


Some important results involving Fourier transforms (and corresponding inverse 
Fourier transforms) are detailed below. 


and 


Linearity 
If a and b are constants, 


F {af (x) + bg(x)} =a F {f x) + bF{g(x)}. (3.30) 


Fourier transform of derivatives 


Fourier transforms are useful for solving linear differential equations, and therefore 
it is useful to find Fourier transforms of derivatives. 


Let f(x) be continuous and f(x) —> 0 as |x| — oo. Furthermore, let f'(x) be 
absolutely integrable on the x axis. Then 


FIF A =ioF { f(x)} (3.31) 


and for higher derivatives 


Ff" (x)} = —@ Ff (x)}- (3.32) 


Cosine and sine transforms of derivatives 


Let f(x) be continuous and absolutely integrable on the x axis. Also let f’(x) be 
piecewise continuous on each finite interval, and f(x) —> 0 as |x| —> oo. Then 


2 
Fei f'E) =oF s{ f(x)} - [2 ro, 33) 
Fs f' (= — FA f} 


and for higher derivatives 


re = 
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2 
Fel f" A =- wo Fel f (x)} - Ero. 


5 (3.34) 
Fsi f" =- Fsi fE) + oro. 
Convolution theorem 
Define the convolution f * g of functions f and g as 
(f * g)(x) =| Sf (p)g(x — p)dp = f(x — p)g(p) dp. (3.35) 


If f(x) and g(x) are piecewise continuous, bounded, and absolutely integrable on the 


x axis, then 


Fi f x g)}= VF {fi Fgh. 


Taking the inverse transform of Eq. (3.36) yields 


(f *2)(x) = J F(w)G(w)e"* do. 


(3.36) 


(3.37) 


Fourier series and integrals are important tools in solving boundary value prob- 
lems as shown in Chapter 5. 


Table 3.2 Special Fourier transforms. 


fœ) 
1 if |x| <a, 
0 if |x|>a 
x 


x" f(x) 


Fl f (x)} = F(w) 


/2 sinaw , 
m w 


f(x) 
1 
x2 +a? 


fO) 


f(ax)el* 


F{f(x)} = Fw) 
EZ 
2 a 


i” w” F (w) 


1 (=) 
-F 
a a 


a See Example 3.2.1. 


Special Fourier transform pairs are listed in Table 3.2 for the given definition 
in Eq. (3.24) and Eq. (3.25) as follows from Spiegel (1973). The listings differ 


slightly as a different normalization constant ( 


1 
J 20 


) was used in Spiegel (1973), 


such that A{f(x)} = F(w) = ie fœ dx and F-"{F(w)} = f(x) = 


1 f” ; 
_ i F(w) el wx 
2 J ce 


dw. 


E 
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Table 3.3 Fourier sine transforms. 


f(x) Ff f (x)} = Fs (w) f (x) Fs{ f (x)} = Fs(w) 
1 0<x <a, 2 (1—cosaw “4 T 
fait (gramimatciateeeciee x J£ 
0, x>a T w 2 
x m e71 eax 2 w. 
x2 +a? yV 2 Va (w? +a? 
cane {2 T(n) sin(n tan! 2) Me J2 a y? 
m (w2 +.a?)n/2 i 4a3/2 
1 n—1 nn 
sone ae on TU RED Ganza 
Jw 2 l(a) 


w <a, 


1x me” 
w=a, tan — = 
a 2 w 


w>a 


n 
IT 
0, 
V27 /4 
V1 /2, 


E tanh (32) 1 
2 a ex —] 


Fourier sine and cosine transform pairs are also listed in Table 3.3 and Table 3.4, 
respectively, given the definitions of Eq. (3.26) to Eq. (3.29). Again, account has 
been taken of the fact that the normalization constant used in Spiegel (1973) for the 
transform definitions differ. 


sinax 1 l w+a sinax 
x A2 w—a x2 


CSC ax 


foe) 
Fourier sine transforms: We have ¥5{f (x)} = Fs(w) = I f(x) sinwx dx and 
2 f° i 
FHF (w) = fœ) = zf F;(w) sin wx dw. 
T JO 
[0,0] 
Fourier cosine transforms: We have ¥¢{ f (x)} = Fe(w) = f f(x) cos wx dx and 
0 


F—"(F,w)) = f= z A EE 
0 


3.3 Discrete Fourier transforms and fast Fourier transforms 


The discrete Fourier transform (DFT) is equivalent to the continuous Fourier trans- 
form in Section 3.2 but instead relies on an individual sampling of a finite sequence of 
data where each value is separated by a sampling time T. This method is very useful 
in a wide number of fields that include, for instance, engineering, science, mathemat- 
ics, and music. For example, DFT applications are used in the processing of digital 


ss =. ._._._.LdrlC OER 
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Table 3.4 Fourier cosine transforms. 


f (x) FA f (x)} = Fe(w) fœ) Fel f(x)} = Fe (w) 
1, O<x<a, = 1 Et 
0, x>a m w x? +a? 2 a 
ent Co tl eax (22 cos(n tan! a 
m w +a? x  (w2ta2yn/2 
2 n=l g ni 
ena en aa x7” LE a seol A O<n<l 
2a 2 r(n) 
in L i x2 4 a2 J2(e7 Pv _ ea) 
Jw x2 + b? w3/2w 
; J /2, w <a, i 
sinax ; 2 w2 . w? 
D vV27/4, w=a, sinax ia (cos aq 7 sin ra) 
w>a 
a= 1 2 
sechax p eo cosax? aa (cos 7; +sin Z) 
h 2 h(/Tw/2 -ayx 1 — 
eigen) ostili 2) ae, —= (cos(2a,/w) — sin(2a./w)) 
cosh(./7x) cosh(./7 w) JX Jw 


signals such as for sound or radio waves that are collected over a finite interval of 
time. It is also used for the processing of images where pixels are used to raster an 
image or for the solution of partial differential equations with Dirichlet and Neumann 
boundary conditions. In addition, it can be applied for more precise mathematical op- 
erations such as for convolutions or with the multiplication of large integer values. 
To improve the computing speed of the DFT analysis, a fast Fourier transform (FFT) 
method is able to factor the resulting DFT matrices into very sparse matrices. This 
approach results in an enormous increase in speed in the analysis of long data sets. 
The method is also able to avoid round-off errors for more accurate evaluations. 


The following discussion is based on the analysis in Oxford (2020), Smith (1997), 
and Press et al. (1986). 


3.3.1 Discrete Fourier transforms 


With a continuous Fourier transform in Section 3.2, a Fourier transform of an original 
signal f(t) with a suitable normalization is defined by 


F(w)= =|. Fie dt. 


Consider the case of a sampling of data f (k) that gives rise to N individual values of 
fiO), FUL), fE], ...., FLV — 1] that are separated by a sample time T. One can think 


OOOO. 
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of each sample as an impulse with an area f[k], with a transform integral that now 
has finite integration limits. The integral can be further replaced by a summation over 
the sampling points: 


(N-1)T 
F(w) =f f(the i dt 
0 


= f(e. + fle? +... + fikle i? +... f[N — Me tet 
N-1 

= = fine. 
k=0 


With only N data points, there are only N significant outputs. A continuous Fourier 
transform is evaluated over the integral limits of —oo to oo for a periodic func- 
tion. Similarly, with only a finite number of data points, the DFT treats data in a 
periodic manner as well, where the interval f(N) to f(2N — 1) is identical to the 
sequence f(0) to f(N — 1). The fundamental frequency for one cycle per sequence 


: TT : . IS 
is w = — Hz or w = —~ rad/s, including w = 0 (for a nonoscillating or average 


component of the signal), as well as higher-order harmonics. 


Hence, in general, the DFT F[n] of the sequence f[k] is defined as (Oxford, 
2020) 


Fin}= >> fikei forn=0,., N1. (3.38) 


F[0] 1 1 1 1 f [0] 
F[1] 1 wW w2 wN-! fU] 
F[2 — w2 w4 w-2 f j, 
rin- La w w ow PL pw 


where W = e~!27/ and W = W2". An inverse transform, as analogously defined 
for the continuous transform, is 


N 


(= 1 2 nk 
5 F[n]i T" fork =0,...,N—1, (3.39) 
n=0 


fk] = 


z| 


where F[n] coefficients are complex and the resulting spectrum is symmetrical about 
N /2. The inverse matrix is therefore derived as 1/N times the complex conjugate 
of the original matrix. For the inverse transform, the terms F[n] and F[N — n] give 


ss O 
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rise to two frequency components of which only the lower frequency component at 


2 
n: ts Hz for n < N/2 is valid, while the other one is extraneous for n > N/2. 


This latter component is termed an aliasing frequency and is an artifact of the signal 
processing that can be avoided by applying a low-pass filter. Thus, for these two 
contributions, one has 


1 » LN . LI 
falkl = [Fine Er + FIN —nle! aoe (3.40) 


where for f[k] real 


N-1 
F[N C n] = f [klei iw N-k 


(3.41) 


: : 27 
= fik] e7 i2rk ei wink = F*[n], 
k=0 =1Vk 


where F*[n] is the complex conjugate. Thus, substituting, Eq. (3.41) into Eq. (3.40), 
noting that e'** = 1, and applying Euler’s formula for the complex exponential, one 
obtains 


=— {Fin Jei wrk q p* [n le ae 


= 5; {Retin |} cos Z ik —Im{ Fn nj} sin snk (3.42) 


= 5, firin ios | (2) kT + arg Find |. 


2 
This formula gives a sampled sinusoidal wave at a frequency of = and a signal 


ae See 


2 
amplitude of N |F[n]|. 


Example 3.3.1. Given a continuous signal that is composed of one nonoscillating 
component and two oscillating components, f (t) = 3+ 2cos(2mt — 1/2) + cos4zt, 
evaluate the DFT for this signal sampled at four times per second (that is, 4 Hz) from 
t = 0 to t = 3/4. 


k 
Solution. Putting the time t = kT = —, the values for the discrete sampling of the 


continuous signal in Fig. 3.7 become 
T ud 
flkl =3+2cos (Zk — ~) +coswk. 
` 2 2 
Thus, with N = 4 


T 
f{0] = 3 +2 cos (-5) + cos(0) = 4, 


OOOO 
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fik] 


k(=4t) 


FIGURE 3.7 


Continuous signal and individual components of the signal. 


fU] =3 + 2cos(0) + cos(x) = 4, 
f[2] =3 + 2cos (5) + cos(27) = 4, 


3m x 
fB] =3+2cos (F — =) + cos(3z) = 0. 


From Eq. (3.38), with N = 4 


3 3 
Fin = Y fike E = Y fe. 
k=0 


k=0 
Hence, 
FOJ pi i 1 LP sory [2 
PMi =i i AA 
FR| Ti =i 1 ilal | “a [answer] 
FBI} Lio i =i -ij fB] L4 


Example 3.3.2. For Example 3.3.1, evaluate the contribution of f[k] from the in- 
verse transform results for F[0], F[1], and F[2]. 


Solution. 
First component: Since F[0] = 12, from Eq. (3.39) as a special case that n = 0, 
1 


12 
folk] = go = 7 = 3 (that is, a constant). [answer] 
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Second component: Since F[1] = —4i = F*[3], the peak amplitude for the funda- 
2 2 
mental component fı[k] is g FU = — x 4=2. The phase for —4i from polar 


4 
coordinates for this point in a complex x-y diagram is arg(F[1]) = — iN Hence, from 
wG oes ar aak l | 
. (3.42), =2cos | —kT — — | =2cos(—k— —). [answer 
É i NT 2 a "9 


N 
Third component: Since F[2] = 4, where n = 7’ the second term in Eq. (3.40) is 
1 ; 27 
extraneous so that there are no N — n components such that fo[k] = N F[2]e' Fk 
e'™* — cos xk since sin zk = 0 for all k. [answer] 
As expected, all f,[k] (for n = 1,2,3) match the three components for the discrete 
sampling of the continuous signal f[k] in Example 3.3.1. 


In addition to aliasing where different signals can become indistinguishable when 
sampled, spectral leakage can also occur when a noninteger number of periods of 
a signal is transformed. This phenomenon results in a spread of the signal among 
several frequencies after the DFT analysis. 


Boundary value problems 


The DFT technique is also used to numerically solve boundary value problems for 
partial differential equations in Chapter 5 and Chapter 6. For instance, consider heat 
conduction in a flat plate as described by a Poisson partial differential equation, 
V?u = f(x,y), in Eq. (6.13). To solve this problem, one can apply a discrete in- 
verse Fourier transform with a double sum for the temperature in the plate u(x, y) at 
the position x and y with an equivalent relation to Eq. (3.39), such that (see Press et 
al., 1986) 


1 J-1L-1 
~ —2nijm/J „—2riln/L 
uj) = ra > 5 ûmne xijm/ ent n/ : 


m=0 n=0 
Here the function u(x, y) is represented by values at the discrete set of points x; = 
xo + jh, j =0,1,..., J, and y = yo + lh, l = 0, 1, ..., L, and h is the common grid 
spacing. Similarly, 


J—1 L-1 


1 Ke aa aes 
fi= T > 5 fnne 2nijm/J ,—2niln/L 


m=0 n=0 


Note that the nomenclature of Press et al. (1986) is adopted in this derivation, which 
differs only with a sign change in the exponential for the transform pairs. One just 
needs to be consistent on taking the transform and its inverse, depending on which 
definition is followed. Substituting these expressions into Eq. (6.17), one obtains 


Rinn (onan 4 eo 2ninfs + e2tin/L + eo 2rin/L -4) = taal. 


n 
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This latter expression can be simplified and solved for tiny: 


a Ênnh? 
RHETT 2am 2rn ` 
2 | cos + cos 2 
J L 


Thus, for the solution of the heat conduction equation, one computes Fan as the 
Fourier transform 


J—1 L-1 


F 2nimj/J „2rinl/L 
finn = 5 > fje INe E 


j=0 1=0 


This transform is used in the expression for ûmn and then an inverse transform of this 
subsequent result is taken to obtain the final solution for u j;. This analysis is only 
applicable for periodic boundary conditions: u jj = uj+y,1 = U j +L- 


However, the methodology can be easily extended to a Dirichlet boundary condi- 
tion, where u = 0 on the rectangular boundaries. In this case, 


J-1L-1 r 
j= tt sin sin — 
“=F E 


m=1n=1 


and one computes analogously the sine transform for (ae 


J-1L-1 


7 _ ajm , ain 
= j} SiN sin ——-. 
Finn 2 di J L 


This expression is used in the similar expression for ûmn, 


: funk” 
Umn = rm rn ; 
2 (cos Z + cos Z2 -2) 
J L 


Again, an inverse sine transform is applied to the resulting expression for tm, to 
obtain the final solution for u j;. 


In the case of an inhomogeneous boundary condition, such that u = 0 on all 

boundaries except at x = Jh, where u = g(y), one simply adds the above solution 
Pu a? 

to the solution for the homogeneous equation V7u = x T > = 0 that satisfies 
x y 

the required boundary condition. For instance in the continuum for a flat plate from 

Example 5.2.6, the solution is given by Eq. (5.28): 
H nx . nity 


u~ = An sinh —— sin ——. 
Jh Lh 
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Here A, is found by imposing the boundary condition such that u = g(y) at x = Jh. 
Analogously, for the discrete case 


where A, is obtained from the inverse formula 


tL = nol 
An = sin ——, 
"~ sinh sn 28 L 


where g; = g(y = lh). The complete solution is 
u=uji+ ull. 


Algorithms using FFT methods (see Section 3.3.2) are given by Press et al. (1986) 
to efficiently solve these transforms numerically. 


3.3.2 Fast Fourier transforms 


The number of multiplications to evaluate a DFT grows significantly with the length 
of a transform as N?, resulting in a loss of computational speed. This result occurs 
due to the matrix and vector multiplication shown for the matrix form of Eq. (3.38) 
in Section 3.3.1. The FFT method was developed, subsequently, recognizing and 
eliminating the redundancy in the DFT calculations. Here F'[n] in Eq. (3.38) can 
be rewritten as 


: 27r 
where wrk =e 'w"* Tt is recognized that wrk is evaluated many times and is peri- 
odic, as demonstrated in Example 3.3.3 for N = 8 (which is an integral power of 2). 


Example 3.3.3. Evaluate wk for N = 8 and k = 0 to 7. 


On l—i 
a! 


Solution. We have We =e! 
/2 


—ai = —a*, a* = —1, a = —a, af =i, a’ = ai =a* and a = 1. Hence, it can be 
seen that We = —Wọ, wẹ = —W4, we = Wẹ, and wg = —W;3. [answer] 


=a. Thus, it follows that a? = —i, a? = 


To take advantage of this symmetry and periodicity, one splits the N samples into 
two summations of size N /2, with one summation having an even value for k and the 
other an odd value, so that 


N/2-1 N/2-1 
Finj= D> flamwe + Y fam + wy”. 


m=0 m=0 
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: _j 2a —i 2% : 
Given that We =e iW mn) — gt wpm — W/o one obtains 


N/2-1 N/2-1 
Flnl= D> fl2m|Wwiy+ Wy Do fl2m + Wx" has) 
m=0 m=0 ` 


= G[n]+ W” Hin]. 


Consequently, the N-point DFT can be split into two single N /2-point transforms, 
where one contains even input data G[n] and the other odd input data H[n]. Even 
though the frequency index n ranges over N values, only N/2 computations are 
needed as both G[n] and H[n] are periodic in n having a period N /2. This results in 
important computational savings, as demonstrated in Example 3.3.4 for N = 8. 


Example 3.3.4. For the case of N samples containing even input data f[0], f[2], 
f[4], f[6] and odd input data f[1], f[3], f[5], f[7], evaluate the N-point DFT. 


Solution. We have 


F [0] = G[0] + We A[0}, 

FU] = G[1] + Wg H[1], 

F[2] = G[2] + Wg H[2], 

F[3] = G[3] + We H[3], 

F[4] = G[4] + Wg H[4] = G[0] — WÊ H{0], 

F[5] = G[5] + We H[5] = G[1] — Wz H[1], 

F[6] = G[6] + We [6] = G[2] — We H{[2], 

F{[7] = G[7] + W, H{7] = G[3]— Wẹ H[3]. [answer] 


This calculation is depicted in the flow diagram of Fig. 3.8. The basis of the FFT 
calculation is a butterfly computation shown in Fig. 3.9 because of the criss-cross 
nature of the calculation. 


With N as a power of two, the process in Fig. 3.8 can be repeated with a break- 
down of the N/2 signals into N /4 transforms, and so on (see Fig. 3.10). Each stage 
uses a so-called interface decomposition, which enables a separating of the even- 
and odd-numbered samples. There are log, N stages required for the decomposition. 
For example, in Fig. 3.10, for the eight-point signal (23), three stages are required. 
Similarly, a 16-point signal (2+) requires four stages, a 512-point signal (27) needs 
seven stages, a 4096-point signal (2!*) involves 12 stages, and so on. As previously 
mentioned, a DFT analysis requires N* complex operations. On the other hand, there 
are N/2 complex multiplications to combine the results from the previous stage and 
log, N stages so that (N/2) - log) N operations are needed to evaluate an N-point 
DFT with the FFT. This results in a considerable saving in the computational time. 
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fo) | G[0] F[O] 

f2l | np G2] F(1) 
point 

fa _ | OFT GI4] F[2] 

fi6] c6 \ X X /_ Bl 

fl) Ma) 7 X X_ XK_\ lal 

f3] ni Ma) #7 AAA 
point 

f5] DFT HIS] ; ’ _ F{6] 

A7) H) FU] 


FIGURE 3.8 


FFT analysis for an eight-point sampling separated into two four-point transformations. 


A A+WiB 


FIGURE 3.9 


Basic butterfly operation in an FFT. 


For instance, for N = 256, the DFT would require 65,536 multiplications compared 
to only 1,024 with the FFT. For N = 1024, the DFT requires a very large number 
of 1,048,576 multiplications compared with a much smaller number of 5,120 for the 
FFT. 


The time domain decomposition is usually performed using a bit reversal sorting. 
Here there is a reordering of the N time domain samples employing a binary counting 
with the bits flipped left-for-right (see Fig. 3.11). This process allows the algorithm to 
effectively decimate the sequence of input samples at each stage into separate even- 
and odd-indexed samples. 
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1 signal of 8 points 


2 signals of 4 points 


4 signals of 2 points 


8 signals of 1 point 


FIGURE 3.10 


FFT decomposition where an N-point signal is decomposed into N signals, each containing 
a single point. 


Input data DFT 
fA _ F{O] 
N/4 
point 
fal __ FL 
Index | Binary Bit- Bit- 
No. (k) | No. sed sed 
aes o ra 
200 000 o point 
2 fis] F{3] 
3 
4 1 = — 4 
: fi) N/A F[4) 
6 point 
7 fi5] F[S] 
£13] J wa | F[6] 
point 
An __] F[7] 


FIGURE 3.11 


Evaluation of the bit-reversed counting of N = 8 input data separated as an N/4 DFT. 


Thus, in the FFT process N-point time domain signals are decomposed into N 
separate time domain signals, each with a single point, as shown in Fig. 3.10. The 
next step involves a calculation of the N frequency spectra using the corresponding 
N time domain data. The N spectra are then synthesized into a single frequency 
spectrum. The algorithm for this analysis is shown in Fig. 3.12. 


Problems 


Time Domain Data 


- 4 Time domain 

Bit Reversal decomposition 
Data Sorting 

H| >= 

u tY- 

O| o 

Al £ 

Ss =] 

al a 

Ş § Butterfy L Frequency 

o o Calculation domain 

ol 6 i 

2 2 synthesis 

ol o 

S| 3 


Frequency Domain Data 
FIGURE 3.12 


Algorithm for the fast Fourier transform. 


Problems 


3.1 Find #7! | ae | using the property for differentiation of a transform. 
s4—a 
cosat — cos bt 


3.2 Find z 7 


| using the property for integration of a transform. 


3.3 Find Z7! | 7] | using the convolution theorem. 
s4— 
e Ts 


(s? + 1) 


3.4 Find the inverse Laplace transform 77! | using the second shifting 


theorem, and sketch the resultant function. 


3.5 Find the inverse Laplace transform -%7 ! | C-i | using the property for the 
s2 — 


differentiation of a transform. 


aiT 2 

3.6 Find the inverse Laplace transform 2 —+—— ]ņọ using the con- 
s2 s?+1 

volution theorem, where I" is the gamma function. 

0 if —l<x <0, 

2 if O<x <1. 

(a) Sketch this function and write it in terms of Heaviside step functions. 

(b) What is the Laplace transform of f (x)? 


3.7 Consider the function f(x) = | 
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3.8 Find the Fourier transform ¥ { f (x)} by direct integration, where f(x) is given 
e~ if x <0, 

by Fs) =| 0 if x>0. 

3.9 Show that the Fourier sine transform pair as given in mathematical handbooks 

(for example, Spiegel, 1973), 


l(a) . an 
sin —, 
a 2 


f@)=x*! O<a<1) and Fw) =F (¢0)= 2 


Ww 


reduces to the following transform pair for an appropriate choice of a: 


1 1 
=— and F(w)= F = —. 
f(x) ce (w) {fa} Tar 
3.10 Determine the particular solution for the nonhomogeneous, second-order, or- 
dinary differential equation y” + y’ — 2y = 4 sin 2x using a Fourier transform method 
(compare your answer to the solution in Problem 2.15(a) and (b)). 


=(x/a)? 
if it is centered 


3.11 Show that the normal distribution reduces to f(x) = 


at the origin, where a = 20. The maximum of this function occurs at f (0) = 


1 
asi’ 
where a is the width of the curve at half the maximum value. 
(a) Show that the Fourier transform of this function is F(w) = F {f(x)} = 
1 
ew by This function is also a Gaussian function, where b = 2/a is the corre- 


J 20 
sponding width of the transformed curve at half-maximum. 


(b) Recover f(x) by evaluating the inverse Fourier transform F a {F (w)} for the 
function F (w) given in part (a). 


—x/a 


—e for x > 0, 


1 
3.12 Given the function f(x) = -e7*l/4 = í 
s a@l@ for x<0. 
a 


(a) Show that the Fourier transform of this function f(x) is F(w) = 


ie 1 
x aal 


(b) Recover f(x) by evaluating the inverse Fourier transform Z TI{F (w)} for 
the function F (w) given in part (a) based on a contour integration method (see also 
Chapter 10). Hint: Consider the contour integral 


dz, 


1 eizx* 
a (+5) E— 7) 


where the contours C4 and C2 are given in Fig. 3.13(a) and Fig. 3.13(b) for x > 0 and 
x < 0, respectively, in the limit that R — oo. 


Problems 
(Contour y 
y for x <0) | 
(Contour R +R 
for x > 0) Ci — x 
x 
3 R 
R + © 
(a) (b) 
FIGURE 3.13 


Contours for the integral (a) for the case that x > 0 and (b) for the case that x < 0. 


1 
3.13 Consider the Laplace transform F(s) = a 
y— 


M 
(a) Show that this transform satisfies the condition |F (s)| < Re in Eq. (3.12) by 
finding the values of M and k. Hint: Given the relation |a + b| < |a| + |b|, letting a = 
1 


zı — z2 and b = 2), it follows that |z1 — z2| > |z1| — |z2| or 


< i 
l |zı —z2| [zi] — [zal 
Using this latter relation, with z1 = s = Ret? and z2 = 2, one obtains the required 
1 


result: | —— | < —. 
s—2 R-2 


St 


(b) What is the residue of < 


sc 


5 at the simple pole s = 2? 


1 
(c) From parts (a) and (b), calculate # ral {=} using the complex inversion 
s 


formula. Compare your result to that given in Table 3.1 for a = 2. 


3.14 Solve the system of chemical kinetic equations in Example 2.4.3 using Laplace 
transforms. 
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CHAPTER 


Matrices, linear systems, 
and vector analysis 


This chapter describes the evaluation of the determinant and the calculation of the in- 
verse of a matrix. It also describes a solution method for systems of equations, which 
can be solved in a matrix form. This approach becomes important in numerical meth- 
ods for solution of partial differential equations, as detailed in Chapter 5. Moreover, 
matrix procedures and solution of systems of equations can be applied with the Maple 
software package, as further detailed in Appendix A. 

In addition, this chapter further details vector analysis, which is concerned with 
the differentiation and integration of vector fields. Vector fields arise in physics and 
engineering disciplines, particularly in the description of electromagnetic and gravi- 
tational phenomena as well as fluid flow. 


Definition of a matrix 


A matrix of order m x n is a rectangular array of numbers having m rows and n 
columns. It can be written in the form 


d&il 412 >> Ain 

a a eae a 
A= 21 22 2n 

amli Am2 °*** Amn 


Each number ajx in this matrix is called an element. The subscripts j and k indicate 
respectively the row and column of the matrix. 


The transpose of a matrix, denoted by A’, is obtained by interchanging the rows and 


columns of a matrix A, that is, if A = (a jg), then Al = (akj) Or 


4j] 421 +++ Aml 
T 412 422 `° Am2 
An 
Ain 42n ``’ Amn 
Advanced Mathematics for Engineering Students. https://doi.org/10.1016/B978-0-12-823681-9.00012-5 1 1 1 
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4.1 Determinants 


If the matrix A is a square matrix, then the determinant of A of order n is given by 


áil 412 vst Alin 
a a eee a 

D=detA =|" “2 a) (4.1) 
Ami Am2 °*** Amn 


The value of det A can be obtained as follows. 


(i) Given any element a jg of D, a new determinant of order (n — 1) can be obtained 
by removing elements of the jth row and kth column, called the minor M jx. For 
example, the minor corresponding to element 5 in the 2nd row and 3rd column 
of the fourth-order determinant 


3 0 2 
1 -1 2j, 
—2 4 #1 


which is obtained by removing the shaded elements. 
(ii) Multiplying the minor of ajg by (—1)/+*, the result is the cofactor of ajx ex- 
pressed as C jz = (—1)/ tk M jk. For example, the cofactor of element 5 is 


3 0 2 
C3 =(-1® |1 -1 2l. 
=2 a i 


The value of the determinant is then defined as the sum of the products of the el- 
ements of any row (or column) by their corresponding cofactors and is called the 
Laplace expansion, 


n 
det A = par ers (4.2) 
k=1 
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Example 4.1.1. (Determinant of second order) 
We have 


ait 412 
a21) 422 


D=detA= 


For (j = 1) the first row is D = aj1a22 + a12(—a21) = 411422 — a12a21. The same 
value is obtained using the elements of the second row (or first and second columns). 


Example 4.1.2. (Determinant of third order) 
We have 


4i] 412 413 
D = det A = |a an anl. 
431 432 433 


The cofactors of the elements in the first row are (j = 1) 


1+1 |422 423 
Ci =(-)) = 412433 — 423432, 
a32 433 
1+2 |421 a23 
C2 =(-1) = —(a21433 — 423431), 
a31 433 
1+3 [a21 an 
C3 = (-1) = A421432 — 422431. 
a31 a32 


Hence, D = a11C11 + a12C12 + a13C13 = a11 (a22a433 — 423432) — a12(a21433 — 
473031) + a13 (421432 — 422431). 

The same value is obtained by using the elements of the second or third rows (or first, 
second, and third columns). 


Example 4.1.3. (Determinant of a triangular matrix) 
The determinant of any triangular matrix equals the products of all the entries on the 
main diagonal. For example, 


-1 


Theory on determinants 
Theorem 4.1. We have det(A) = det(A’). 


Theorem 4.2. The value of the determinant remains unchanged by multiplying the 
elements of any row (or column) by a given number and adding the corresponding 
elements of any other row (or column). 


Theorem 4.3. [fall elements of any row (or column) are zero except for one element, 
then the value of the determinant is equal to the product of that element by its cofactor. 


EEE 


114 CHAPTER 4 Matrices, linear systems, and vector analysis 


Theorem 4.4. If vi, v2, ..., VUn represent row vectors (or columns) of a square ma- 
trix A of order n, then det(A) = 0 if and only if there exist constants (scalars) 
1, A2, «++, An not all zero such that 


Ayvy tAgve +... + ÀnUVn = 0, (4.3) 


where zero is the null row matrix. If Eq. (4.3) is satisfied, the vectors are linearly 


dependent. Otherwise, they are linearly independent. A matrix A such that det A = 0 


is called a singular matrix . If det A 4 0, then A is nonsingular. 


In practice, a determinant of order n is evaluated by using Theorem 2 successfully 
to replace all but one of the elements in a row (or column) and then using Theorem 3 
to obtain a new determinant of order n — 1. This procedure is continued in this man- 
ner, arriving ultimately at determinants of order 2 or 3 which are easily evaluated. 


4.2 Inverse of a matrix 


If A is a nonsingular square matrix of order n (where det A Æ 0), then there exists a 
unique inverse matrix A~! in the following form: 


C T 
=| = ( jk) À (4.4) 
det(A) 
where (Cx) is the matrix of cofactors C jg and (Cx)? = Cx; is its transpose. 
-1 —2 4 
Example 4.2.1. Find the inverse of A=] 2 -5 2 
3 —4 6 
Solution. Using Eq. (4.2), det A = — 1 (38) — 2(18) + 4(7) = —46. Moreover, 
-5 2 — 4 4 
TE 7-38 c= [4 4 —28, cu =| 5 ma 
2 2 -1 4 -1 4 
cz=-h 2 a8 cn= 4 —6, C32 = ie j= 10. 
2 —5 -1 —2 -l -2 
cn= [5 3 ca=-|; Z4|=-10, cn=[5 “)=2 
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Using Eq. (4.4), the inverse matrix is therefore 


19 14 8 
23 23 23 
38 —28 16 
1 
Poe 18 -6 10|= -2 2 ee : [answer] 
7 5 9 
46 23 46 
Check the following inverse matrix: 
19 14 8 
23 23 23 
-1 —2 4 1 0 0 
Mato ag oie? 2. 2 belo lA A 
3 -4 6 23 23 23 0 0 1 
7 5 9 
46 23 46 
4.3 Linear systems of equations 
A set of equations having the form 
aux + anx? + .. + Gintn=bi, 
a T ae Te laa? “A eee (4.5) 
AmıXı + Am2X2 + ... + AmnXn = bin 


is called a system of m linear equations in the n unknowns x1, x2..., Xn. If b1, b2..., bm 
are all zero, the system is called homogeneous. If they are not all zero, it is called 
nonhomogeneous. Any set of numbers x1, X2..., Xn, Which satisfies Eq. (4.5), is called 
a solution of the system. 


In matrix form, Eq. (4.5) can be written as 


x b 

ai} 412 `‘ Gin l A 

X2 2 

a a © a 2 
e a Re ee (4.6) 

Aml Am2 *** Amn Xn bm 
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or in a vector equation, 


Ax =b. (4.7) 


Solution of systems of n equations inn unknowns 


(i) Inverse method. 
If m =n and A is a nonsingular matrix so that A~! exists, Eq. (4.7) can be solved by 


x=A7!b, (4.8) 


and the system has a unique solution. 
Example 4.3.1. Solve 


—x, —2x2+ 4x3 =-—3, 
2x, — 5x2 + 2x3 = 7, 
3x, —4x2— 6x3=5. 


-1 -2 4 x1 =3 
Solution. We have | 2 —5 2 x2}=] 7 
3 —4 —6] | Xn 5 
19 14 8 
23 23 23 
. -1 9 3 5 ; 
From Section 4.2, A™ =| —_— = —— |. Therefore, Eq. (4.8) gives 
23 23 23 
7 5 9 
46 23 46 
19 14 8 
23 23 23 
- x a aizia 
aaen ee 7 ļ=ļ|1 
x3 23 23 23 5 1 
7 5 9 
46 23 46 


The solution is xı = 5, x2 = 1, and x3 = 1. [answer] 


(ii) Cramer’s rule. 
The unknowns x1, x2, ..., Xn can be determined from Cramer’s rule: 


Dı D2 Dn 
X1 S5 1X2 Fe Kn 5 


= 4. 
D D D’ a9 
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where D = det(A) is the determinant of the system (given by Section 4.1) and Dx, 
k =1,2,...,n, is the determinant obtained from D by removing the kth row and 
replacing it with the vector b. 

The following four cases can arise. 


Case 1. D Æ 0, b #0. In this case, there will be a unique solution where not all xg 
will be zero. 


Case 2. D Æ 0, b = 0. In this case, the only solution will be x; = 0, x2 = 0, ..., x, =0 
(that is, x = 0). This result is often called the trivial solution. 


Case 3. D = 0, b = 0. In this case, there will be infinitely many solutions other than 
the trivial solution. At least one of the equations can be obtained from the others, that 
is, the equations are linearly dependent. 


Case 4. D = 0, b £0. In this case, infinitely many solutions will exist if and only if 
all of the determinants Dx in Eq. (4.9) are zero. Otherwise there will be no solution. 


Example 4.3.2. Solve the system 


=x] — 2x2 + 4x3=-3, 
2x1 —5x2+ 2x3=7, 
3x, —4x2— 6x3=5. 
Solution. By Cramer’s rule, 
—3 —2 4 -1 -3 4 -1l -—2 -3 
7 —5 2 2 7 2 2 -5 7 
5 -4 -6 3 5 —6 3 —4 5 
x% = D ’ x2>= $ X3 kg D ’ 
-1 —2 4 
where the determinant of the coefficients is |2 —5 2 | = —46 (see Exam- 
3 -4 -6 


ple 4.1.2). Since D Æ 0 and b £0, this indicates that Case 1 applies with a unique 
solution. Consider 


-3 —2 4 

5 2 7 2 7 —5 
7 -5 2 = -3| =| +4] | 
T ene -4 -6 Reel |5 —4 


= (—3) (38) + 2(—52) + 4(—3) = —230. 


Therefore, x; = D,/D = (—230)/(—46) = 5. Evaluation of the other determinants 
yields x2 = 1 and x3 = 1. Hence the final solution is xı = 5, x2 = 1, and x3 = 1 (in 
agreement with Example 4.3.1). [answer] 
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(i) Gauss elimination 


For system Eq. (4.5), described by the vector Eq. (4.7), A = [a jk] is the m x n matrix 


x b 
a a2 >>> Gin l fi 
X2 2 
a a eee a 
Aa| @1 42 2n eer pee ie 
a eee 
mi Am2 amn Xn bm 


The augmented matrix of the system in Eq. (4.5) is defined by 


a > an by 


Aml ++: Amn bm 


In the Gauss elimination method, the augmented matrix is reduced to a “triangular 
form,” from which the values of the unknowns are obtained by “back substitution.” 


Example 4.3.3. Solve the system in Example 4.3.1 by Gauss elimination. 


Solution. Consider the system and its augmented matrix. The equations are given by 


— 2x2 +43 = —3, 
2x1 — 5x2 + 2x3 = 7. 
3x1 — 4x2  — 6x3 = 5. 


The corresponding augmented matrix is 


The three-step process is as follows: 


Step 1. Elimination of x; from the second and third equations. Call the first equation 
above the pivot equation and its x,-term the pivot element (shaded as a circle) in this 


4.3 Linear systems of equations 


step. Use this equation to eliminate x; in the other equations. We have 


=x] —2x2 +4x3 =-3, -1 -2 4 «-3, 
—9x2 4+10x3 =l, 0 —9 10 1, | Row2 + 2Rowl, 
—10x2 +6x3 =-4, 0 —10 6 —4,| Row3 + 3 Row 1. 


Step 2. Elimination of x2 from the third equation. The first equation, which has just 
served as the pivot equation, remains untouched. The next step is to take the second 
(new) equation as the pivot equation. 


—X] —2x9 + 4x3 = -3, 
+ 10x3 = 1, 
— 10x2 + 6x3 = —4. 


The elimination of x2 gives 


—x, —2x2 +4x3 =-3, -l1 -2 4 —3, 
—9x2 +10x3 =1, 0 -9 10 1, 
—46x3 =—46, 0 0 —46 —46,!9 Row 3—10 Row 2. 


Step 3. Back substitution to determine x3, x2, x1. Working backward from the last to 
the first equation of this “triangular” system gives 


—46x3 = —46, x3=1, 
—9x2 4+10x3 =1, x2 = gf 10(1) — 1] =1, 
=x] —2x. +4x3 =-3, x1 =3—2(1) +401) =5. 


The final answer is x; = 5, x2 = 1, and x3 = 1 (in agreement with Examples 4.3.1 
and 4.3.2). [answer] 


Determination of the inverse using Gauss—Jordan elimination 


Gauss-Jordan elimination can be used for practically determining the inverse A~! of 
a nonsingular n x n matrix A. The method is as follows. 


(i) Using A, n systems are formed such that Ax(1) = e(1), ..., AX(n) = e(n) has the jth 
component 1 and the other components 0. Thus, introducing the n x n matrices 
X = [x(1)..-X(n)] and J = [e(1)...e(n)], one can combine the n systems into a single 
matrix equation AX = J and the augmented matrices [Ae(1), ..., A€(n)] into a 
single augmented matrix A=[AI]. Now AX =I implies X = AT! = A~!. To 
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solve AX = I one can apply Gauss elimination to A = [A7] to get [U H], where 
U is upper triangular. 


(ii) The Gauss-Jordan elimination now operates on [U H] and, by eliminating the 
entries in U above the main diagonal, reduces it to [Z K], which is the augmented 
matrix of IX = A~!. Hence, the matrix K = A~! can be immediately read off. 


Example 4.3.4. Find the inverse of A= | 2 —5 2 | by Gauss-Jordan elimi- 
nation. a =e <5 


The inverse of this matrix has been found in Section 4.2. 


Solution. 


(i) Apply the Gauss elimination to the augmented matrix A = [AI]. Then we have 


> 
oS 

II 
N 

l 
Nn 
N 
ee 
= 
© 


o 
l 
O 
— 
© 
N 
— 
==] 


Row 2 + 2. Row 1, 
0 -10 6/3 0 1 Row 3 + 3- Row 1, 


© 
| 
O 
— 
fo) 
NN Fe 


0 0 
1 0 
—10 9 9. Row3 — 10. Row2. 


This augmented matrix [U H] as produced by Gauss elimination agrees with Exam- 
ple 4.3.3. 


(ii) Using additional Gauss-Jordan steps, reduce U to 7. 


ee eee 
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120 —37 20 —-18 
3 B 2B Row 1 + 4- Row 3, 
—9 3 —5 10 
1 =. ee o Row 2 + {| — }- Row 3, 
C1 OD 3 B a (>) om 
—7 5 —9 
1) ee 22.28 
oe 46 23 46 
—19 14 -8 
1 Da aa 
a 23 23 23 
a _9 3 _5 Row 1 — 2-Row2. 
23 23 23 
—7 5 —-9 
||| ae ee 
ae 46 23 46 


The last three columns constitute A~! as per Example 4.2.1. 


(iii) Check the following inverse matrix: 


19 14 8 
3 B 23 
-1 2 4 100 
aal 2 =s oN 3 _5]|=ļ0 1 leaves 
setal a aa soi 
7 5 9 
46 23 4 


4.4 Vector analysis 
This section is based on the work from Towner (2020) and Boas (2006). 


4.4.1 Vectors and fields 

Vectors 

Consider the vectors 4a = (Ax, Ay, Az) and B = (B,, By, Bz) with the given Carte- 
sian components as shown. One can then define the following quantities: 


(a) Scalar product: Z . B = ABcos@ = (A,B, + AyBy + A,B-), where the 


magnitudes of the vectors are given by A = ,/AZ + A} + A? = |A| and B = 
. > > 
| B2 + BS + B? = |B| and @ is the angle between A and B. 


ee —F—TO ee 
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i j k . 
(b) Vector product: Č = Å xB = Ax Ay A,| =(AyBz— AzBy)i+(AzBx— 
B, B, B, 


Ax Bz) j + (AxBy — AyBx)k, where the magnitude |C| = AB sin@ and i, j,k 
are the unit vectors along the x, y, and z axes, respectively. 


A A 


The unit vectors have the orthogonality property: i j = j -k =k-i = 0. The cross- 


A 


~ 2 fF J, a 
products of the unit vectors are i x j =|] 0 O| =&. Similarly, j x k =i and 
O 1 0 


kxi= fe An orthogonal coordinate system is any set of axes for which the unit 


vectors satisfy the orthogonality property. 
Triple products for both scalars and vectors are defined as follows: 


A, Ay A, 
(a) Scalar triple product: A-BxČ= B, By B;|. 
C Cy C 


; > > > > >> > >> 
(b) Vector triple product: A x (B x C)=(A-C)B—-(A-B)C. 
Given the vector A = A, (ft) i+A y(t) j + A(t) k, where only the components 


are functions of time f, the time derivative of A is 


> 

dA dA,» dA, dA; >» 
= k. 

dt dt ne dt it dt 


The normal rules of differentiation apply: 


cd ey ey Oe 
dt dt dt 
d ~> = JÅ > > dB 
—(A.B)=—. Fray eee 
dt dt dt 
d dA dB 
—(AxB)=—— xB +4 — 
dt dt d 
For the last relation, the order must be respected since A x Fi = -É x Z. 
Fields 


A field is defined as a physical quantity, which has different values at different 
points in space, such as: (i) the temperature variation in a room, (ii) the electric field 
around a point charge, or (iii) the gravitational force experienced at or near the earth. 
The word “field” encompasses both the physical quantity and the region of inter- 
est. A scalar field results when the physical quantity is a scalar (such as temperature 


ee eee 
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(x,y,z) 


© 
(Xo,YoZo) 
FIGURE 4.1 


Schematic for the derivation of the directional derivative. 


T (x, y, z)). A vector field results when the physical quantity is a vector (such as the 
electric field E (x, y, z) or the gravitational force F (x, y, Z)). 


A directional derivative is a change in a physical quantity depending on which 
direction one moves. For instance, consider the scalar field ¢ (x, y, z). One wishes to 
find (d@/ds) at a specific point (xo, Yo, Zo) and in a given direction. Moreover, let 
û be a unit vector in the given direction, where # = ai + bj + ck, in which || = 
Va? + b? + c? = 1. Starting from the point (xo, Yo, Zo) in Fig. 4.1 and going the 
distance s to reach the point (x, y, z), one has (x, y, z) — (xo, Yo, Zo) = Us = (ai + 
bj + ch) s. The respective components are (x — xo) = as, (y — yo) = bs, and (z — 
Zo) = cs. The directional derivative is evaluated from 


dp odddx dddy dddz dd P ad ad 
= = a : 
ds oxds dyds dzds dx dy Oz 


ae dp \ + a A 
This result is the dot product of the vector u with the vector (3) i+ (2) j+ 


Ox 
(=) 
az 


dp\ _. . 
(2) =V¢-ii. (4.10) 


The quantity V¢ is called the gradient of ¢ (or grad @), 


_ (2862, (ab) >, (ab) > 
“a (o) a & a (5) Sii 


and V can be thought of as a vector operator: 


oi... fava. FINA 
v=(S)i+(S)i+(S)é (4.12) 


The gradient operator can be defined for different coordinate systems (see Spiegel 
(1973)). 


eC 
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An equipotential surface is one in which the scalar field is constant on this surface, 
that is, (x, y, x) = constant. Using Eq. (4.10), we therefore have 


(3) =v0-a=o. 


ds 


Since the vector products of V@ and ù are zero, these vectors are orthogonal. Since 
ù is a tangent to the equipotential surface, V@ is perpendicular (or normal) to this 
surface. = 

Consider the vector field V (x, y, z) with components Vy, Vy, and Vz, each of 
which is a function of position: 


= ` ` é 
V = V(x, y, z)i + Vy(x, y, Z)j + Vk(x, y, Zk. 


A scalar and vector product can be subsequently formed, respectively, with the vector 


operator V: 
> —- ƏV aV OV 
V-V =div V = + + 
Ox dy Oz 
and 
i j k 
Vx Vv = curl y =I ? ð ð : 
ax dy Oz 
x Vy Vz 
Moreover, the scalar product V with the vector V@ yields the so-called Laplacian 
operator: 
é ð (ð$ 0 (0¢ 0 (0¢ 
2 
= V . Vọ = div grad ġ = 
ve j grad $ TRE 
dg? IP 3p? 
=at 1 ao" 
ox dy Oz 


Example 4.4.1. Prove the vector identity V x V@ = 0. 


Solution. We have 


mp ah); 
= | — — — |z 
dydz dzdy 


ee eee 
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Example 4.4.2. The work done dW in displacing a particle an infinitesimal distance 
dr by a force F is dW = f F . dr. If the particle moves a macroscopic distance 
from point A to B along a path subject to the force F, the total work done W is W = 
f vi È dr Moreover, if V x F = 0, then the force È is termed a conservative force. 


(a) Show that if the force is defined by a scalar potential field such that F= —Vọ, 
then the force is conservative. 
(b) Show that for a conservative force, the line integral is independent of the path. 


Solution. 


(a) Substituting in F = —V¢ gives V x F = —V x V¢ = 0 from the vector identity 
in Example 4.4.1. [answer] 
Bs. B > B 
(b) We have W = A F -dr = — j Vo-dr=— j do = — (¢(B) — ¢ (A)) = 
($ (B) — ¢(A)), as follows from the definition of V@ in Eq. (4.11) and dr = 
idx + jdy + kdz. [answer] 
Choosing A as a reference point (say, at the origin A = (0, 0, 0)) and B as a general 


B> > 
point B(x, y, z), one obtains (x, y, z) — ¢ (0, 0, 0) = — F - dr. Hence, witha 
conservative force, one can find the scalar potential field (up to an arbitrary constant) 
by simply evaluating the line integral over any path that makes the integration easy. 


4.4.2 Integral theorems 


Integral theorems are of importance in electrical engineering, physics, and fluid flow 
problems. 


Green’s theorem 


Consider two well-behaved functions, P(x, y) and Q(x, y), which are continuous 
and differentiable, as defined on the xy plane. The Green’s theorem in two dimen- 


sions states 
ð ə P 
ff (so -Z)aas= fe dx + Q dy), (4.13) 
A\ 0x dy C 


where A is the area bounded by the contour C. As shown in the following example, 
the symbol $ indicates a line integral around a closed curve (that is, ending at the 
starting point), where the direction of integration is taken to be counterclockwise. 


Example 4.4.3. Prove the Green’s theorem given in Eq. (4.13) for two dimensions 
(see Fig. 4.2). 


Solution. 


EEE 
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(a) yulx) (b) 
A 


a i x b x 
FIGURE 4.2 


Schematic for the derivation of the Green’s theorem in two dimensions with integration (a) 
over the two y paths and (b) over the two x paths. 


oP ’ bot , 
Consider the area integral of = where the integration is carried out over y first 


y 
and then over x, as in Fig. 4.2(a), yielding 


yu 3P b yu 
tT izdy=- li ra dx dP 
YL a YL 


z “i [P(x, yy) — P(x, yx) ax 


a 


b a 
f P(x, yt) dx + f P(x, yu) dx 
a b 


— amm — amm 
line integral from 1 to 2 by lower path line integral from 2 to 1 by upper path 


=f P dx. 
C 


Similarly, now considering the area integral of =. with integration first over x and 
x 


then over y, as in Fig. 4.2(b), gives 


J| Sava [va yf arm f a [a0 


f [O(xr, y) — QL, y)ldy 


c d 
Í Q (xL, y)dy F f Q (xr, y)dy 


a ee 
line integral from 3 to 4 by left path line integral from 4 to 3 by right path 


=$ Ody. 


ee eee 
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Hence, adding these two results yields Eq. (4.13). [answer] 


Using the Green’s theorem one can either evaluate a line integral around a closed 
path or a double integral over the area enclosed, whichever is easier to perform. The 
divergence theorem and Stoke’s theorem follow from the Green’s theorem. Here one 


considers P(x, y) and Q(x, y) as two components of a vector field y defined in a 


plane with V = V(x, y)i + Vy (x, y)j and V, = 0. These theorems can be easily 
generalized to three dimensions. The derivation of these theorems are given in the 
following sections. 


Divergence theorem 
Choosing Q = V, and P = — V; in Eq. (4.13) yields 


dV, 3V; 
I Ti 2) dxdy= É (V, dx + Ve dy) 
A Ox dy C ` 


Using the definition for the divergence of Vv in Section 4.4.1, 


I divV dxdy= Q (Wii + Vy j)-Gdy — j dx). (4.14) 
A C 


From Fig. 4.3, let dr be an infinitesimal element of the bounding contour C which 
encloses the area A. Here the tangent dr = idx + jd y. If 7 is the outward normal 
=> A A 
perpendicular to dr, then i = i dy — j dx from Fig. 4.3. Since ñ is a unit vector, it is 
idy— jd 
normalized, ñ = A where ds = y dx? + dy?. Hence, 
s 


ids = Ê dy — j dx). (4.15) 


Using the definition of the div V from Section 4.4.1 and substituting Eq. (4.15) into 
Eq. (4.14) yields the divergence theorem in two dimensions: 


I V-Vadrdy= V «fads. 
A C 


This result can be generalized to three dimensions for a volume V contained 
within a bounding surface S, where do is an area element on the bounding surface 
and dV is a volume element (that is, dV = dx dy dz), such that 


=> > 
JJ 9 Vav=Qh¥ ñas. (4.16) 
V S 


The divergence theorem is used in the analysis of Example 5.1.1, Section 7.2.2. 


ee 
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FIGURE 4.3 


Schematic of the tangent and outward normal for the curve C. 


Stoke’s theorem 
Alternatively, choosing Q = Vy and P = V; in Eq. (4.13) gives 


aVy ƏV 
I aee +) dxdy= Vs dx + V, dy), 
A \ Ox oy C 


Using the definition for the curl of Vv in Section 4.4.1, 


ok ws ee, fe 4 
Tf curl V Rdxdy=$ (vii +Vy J): (dx + jdy). (4.17) 
A C 


Thus, substituting the definition of curl V from Section 4.4.1 gives Stoke’s theorem 


in two dimensions: 
> a > > 
ffo V)-kaxay= V -dr, 
A C 


where dr is the tangent vector as previously shown over the bounding contour. This 
result can also be similarly generalized to three dimensions, where the area A instead 
of being confined to the xy plane is an open surface o and the contour C is the curve 
bounding the surface, as shown in Fig. 4.4. Thus, from Fig. 4.4, if ñ is the unit vector 
normal to the surface, the final result is 


> n > —> 
[fox V)-ido= $ V-T. (4.18) 


C 


Example 4.4.4. Stoke’s theorem is important in magnetism. Using Ampere’s Law, 
> > > 
fe H -dr = I, derive Maxwell ’s equation, V x H = J, using Stoke’s theorem. 


Here H = B / os B is the magnetic field, 4o is the permeability, and Z is the total 
current crossing any surface bounded by C. The integral C is taken around a closed 
contour that bounds an open surface S and J is the current crossing a perpendicular 
area. 


UU  ( eee 
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FIGURE 4.4 


Schematic showing an outward normal on an open surface o with a contour C. 


Solution. 
The quantity J -/do is the amount of current that crosses a surface element do. 


Therefore, the total current over a surface S bounded by a contour C is ff; T -fido. 


Hence, by Ampere’s Law, 
> —> >, 
$ -d= ff J -ndo. 
c S 


Using Stoke’s theorem in Eq. (4.18), 


> => > , >, 
f H-dr= f| vx âdo= [fF -ao 
C o S 


Since this result is valid for any surface S bounded by C, the result is as follows: 


> > 
VxH=J. [answer] 


Problems 

4.1 Given the following system of equations: 
3y+2x=z+1, 
3x +2z = 8 — 5y, 
3z— 1 =x —2y, 


solve for x, y, and z using the following methods: 
(a) Inverse method; 
(b) Cramer’s rule; 
(c) Gauss elimination; 
(d) Maple software package. 


EOC 
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4.2 Find the directional derivative of ¢ = x*y + xz at (1,2, —1) in the direction 
=> 
A Bo A 
Aa = 2i — 2j +k. Hint: Determine the vector û = TAT’ find V@ at (1,2, —1), and 
do 
then calculate T; from Eq. (4.10). 
s 


4.3 What is the work done if F= xyi — yj and dr = idx + jdy assuming the 
path is: 

(a) a straight line where y = 5x from A = (0,0) to B = (2, 1)? 

(b) parabolic where y = gx from A = (0,0) to B = (2, 1)? 

(c) Comment if the evaluated work is the same or different in (a) and (b). 

4.4 Let the force F = (2xy —2)i + x2 f — Gx + DÊ. 

(a) Show that F is conservative. 

(b) Find ¢(x, y, z) — ¢(0, 0, 0). Note that from (a), since the force is conservative, 
any path can be taken to evaluate the line integral. Therefore, use simple straight 
line paths along the same directions as the axes starting from the origin (0, 0, 0) to 
(x, 0,0), then from (x, 0,0) to (x, y,0), and finally up to the final endpoint from 
(x, y, 0) to (x, y, z). 


CHAPTER 


Partial differential 
equations 


Partial differential equations involve functions of several independent variables that 
arise routinely in nature. They describe physical laws in many areas, including fluid 
mechanics, heat and mass transfer, propagation of sound, elasticity, electrostatics and 
electrodynamics (for example, Maxwell’s equations), and relativity, to name a few. 

The derivation of important partial differential equations that arise in engineer- 
ing is presented. Important analytical methods, including separation of variables and 
transform techniques, as employed for solution of various types of engineering prob- 
lems that arise in heat transfer, fluid flow, and diffusion theory, are discussed. 


Definitions 


A partial differential equation is an equation containing an unknown function of two 
or more variables and its partial derivatives with respect to these variables. 


The order of a partial differential equations is that of the highest-order derivatives. 
2 


For example, = = 2x — y is a partial differential equation of order 2. 
x 


A solution of a partial differential equation is any function that satisfies the equation 
identically. 


A general solution is a solution that contains a number of arbitrary independent 
functions equal to the order of the equation. A particular solution is one that is ob- 


tained from the general solution by a particular choice of arbitrary functions. For 
2 


example, the general solution of the partial differential equation = 2x —yis 


xdy 
u = x?°y — 4xy? + F(x) + GQ). With F(x) = 2sinx and G(y) = 3y* — 5, the par- 
ticular solution is u = x? y — xy? + 2sinx + 3y* —5. 


A singular solution is one that cannot be obtained from the general solution by a 
particular choice of arbitrary functions. 


A boundary value problem involving a partial differential equation seeks all solutions 
of a partial differential equation which satisfy conditions called boundary conditions. 


tases ; : ðu ; 
When time is one of the variables, the solution u (or — or both) must satisfy 
initial conditions at t = 0. 


Advanced Mathematics for Engineering Students. https://doi.org/10.1016/B978-0-12-823681-9.00013-7 1 3 1 
Copyright © 2022 Elsevier Inc. All rights reserved. 


M UU 


132 CHAPTER 5 Partial differential equations 


Linear partial differential equations 
The general linear partial differential equation of order 2 in two independent variables 
has the form 
3u +B 3u so“ Dp? 
ax? dxdy ay a 


u ðu 
+E— + Fu=G, (5.1) 
x dy 


where A, B, C, D, E, F, and G may depend on x and y (but not u). If a second-order 
equation with independent variables x and y does not have the form of Eq. (5.1), it is 
nonlinear. If G = 0, the equation is homogeneous; otherwise it is nonhomogeneous. 
Generalizations to higher-order equations are easily made. 


Because of the nature of the solutions of Eq. (5.1), the equation is 


(i) elliptic if B? — 4AC <0, 
(ii) hyperbolic if B? — 4AC > 0, 
(iii) parabolic if B? — 4AC =0. 


5.1 Derivation of important partial differential equations 
Heat conduction and diffusion equation 
The heat conduction equation is given by 


3 
s = KV?u. (5.2) 


Here u(x, y, z, t) is the temperature of a solid at position (x, y, z) at time t. The con- 
stant, x, called the thermal diffusivity, equals K /(o p), where the thermal conductivity 
K, the specific heat o, and the density p are assumed constant. 


. l ; ðu 
The one-dimensional heat conduction equation reduces to F aa 
x 


Example 5.1.1. (Derivation of the heat conduction equation) 


Solution. 
Let V be an arbitrary volume lying within a solid and S denote its surface. Using the 
Fourier law for the heat flux, 


J= —KVu, 


the total quantity of heat leaving S per unit time is 


II (-KWVu) -aas, 


re = = = — 
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where ñ is the normal to the surface S. Thus, the quantity of heat entering S per unit 
time using the divergence theorem in Eq. (4.16) is 


J|) -aas= fff > (rè) p a 


The heat contained in a volume V is given by 


Jff, oovav. 


Then the rate of increase of heat is 


J dV = ON gy 5.4 
a Mheer = [ffoi 64 


Equating Eq. (5.3) and Eq. (5.4), 


Jff g= Jfa KŪu) dv 
Ii, leone -0 (x)| a =0. 


Since V is arbitrary, the integrand (assumed continuous) must be identically zero so 


or 


that 
M Sọ. (Gu) 
aa u 
ý ot 
or, if K, o, and p are constants 
ðu K>- 2 
— = —V -Vu =k V^u 
ot op 


ðu : 
Moreover, for steady-state heat flow (such that an = 0), the equation reduces to 


Laplace’s equation 


V-u=0. 
Example 5.1.2. (Derivation of the diffusion equation) 
Solution. 


Consider the volume element in Fig. 5.1 over which the mass balance is made, where 
Fx is the rate of transfer in the x direction through a unit area. 


s 
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Control volume 


OF. 
F,dydz [Fe + 5, dxldydz 


FIGURE 5.1 


Volume element for derivation of the diffusion equation. 


In the x direction, the net mass flow across the opposing faces is 


OF, 
Input — Output = E — (A + 5 Z ax) dy dz, (5.5) 
x 
and similar expressions hold for the other two directions. In diffusion theory, 
the rate of transfer of a diffusing substance through a unit area is given by 
Fick’s first law of diffusion: 


3 aC 
F=—-DVC (tor example, Fy = -D4£) ; (5.6) 
x 


where C is the concentration of the diffusing substance and D is a diffusion coeffi- 
cient (cm? s~!). The accumulation rate in the volume element is given by 


aC 
Accumulation rate = oo dy dz. (5.7) 


Equating Eq. (5.5) and Eq. (5.7) for all directions and dividing by dx dy dz yields 


ac OF, OF, OF, 
= - £]. 5.8 
at (++) 8) 
Using Eq. (5.6), one obtains 
ac 
— = pve (5.9) 
ot 


if D is constant. This equation is known as the diffusion equation (Fick’s second law) 
and has the same form as the heat conduction equation in Eq. (5.2). 


5.1 Derivation of important partial differential equations 


Vibrating string equation 
The one-dimensional wave equation is given by 


a7u a7u 


This equation is applicable to the small transverse vibrations of a taut, flexible string 
(for example, a violin string), initially located on the x axis and set into motion (see 
Fig. 5.2). 


u(x,t) 


FIGURE 5.2 


Displacement of a string. 


The function u(x,t) is the displacement at any point x of the string at time t. The 
constant c? = T/p, where T is the (constant) tension of the string and p is the (con- 
stant) mass per unit length of the string. It is assumed that no external forces act on 
the string but that it vibrates only due to its elasticity. 


The equation can be generalized to higher dimensions, for example vibrations of a 
membrane (drum head) in two dimensions: 


3?u 2 3u 3u 
= ; 5.11 
a (= u a) ri 
Laplace’s equation 
Laplace’s equation is given by 
V?u =0. (5.12) 


This equation occurs in many fields. In the theory of heat conduction, u is the 
steady-state temperature. In the theory of gravitation or electricity, u represents 
the gravitational or electric potential, respectively. Hence, this equation is called the 
potential equation. 


Poisson’s equation 

Possion’s equation is an extension of Laplace’s equation. For example, Newton’s law 
of gravity is described by V7 = 4 Gpo, where ¢ is the gravitational potential, G is 
the gravitational constant, and pọ is the density of the matter field. 
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5.2 Analytical methods of solution 


There are many methods by which boundary (and initial) value problems involving 
linear partial differential equations can be solved. The following are among the most 
important ones. 


5.2.1 General solutions 


The general solution is first found and then the particular solution which satisfies 
the boundary (and initial) conditions. The following theorems are of fundamental 
importance. 


Superposition principle 

If u1, u2, ..., Un are solutions of a linear homogeneous partial differential equation, 
then c1u1 + c2u2 +... + CnUn is also a solution to the general solution of the ho- 
mogeneous equation, where c1, C2, ..., Cn are arbitrary constants. Sometimes, general 
solutions can be found by using methods of ordinary differential equations. 


General solution of linear nonhomogeneous partial differential equations 
A general solution of a linear nonhomogeneous partial differential equation can be 
obtained by adding a particular solution of the nonhomogeneous equation to the gen- 
eral solution of the homogeneous equation. 


32 
Example 5.2.1. Solve 7 = x? y with boundary conditions z(x, 0) = x?, z(1, y) = 
x 
cos y. 4 
Solution. 5 
Given — see x?y and integrating with respect to x, 
ox \dy 


əz 1 3 
—=- F(y). 
a a y +F) 


Integrating with respect to y yields 
l 32 
z= a y+ | FO)dy + G(x). 
Hence, the general solution is 
l 32 
z, y) = 75 y +H O) + GO), 


where H (y) and G(x) are arbitrary (essential) functions. Using the boundary con- 
ditions z(x, 0) = x? = H(0) + G(x) implies G(x) = x? — H (0). Therefore, z = 


1 
ary + H(y) +x? — H(0). 


el eee 
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1 
Since z(1, y) = cos y, we have cos y = a + H(y)+ 1 — AH(0). Hence, H(y) = 


1 
cos y — a= 1+ H(0). 


1 1 
The particular solution is finally z = zo + cos y — ay +x?—1. [answer] 


If A, B,..., F are constants and G = 0 in the general linear partial differential 
equation of Eq. (5.1), the general solution of the homogeneous equation can be found 
by assuming 

“= earthy 


where a and b are constants to be determined. 


a? a? 3? 
"ypg pa o, 
ax axdy dy 


Example 5.2.2. Find solutions of 


Solution. 
Assume u = e+, Therefore substituting this expression into the partial differential 
equation gives 


(@? +3ab+ 28?) e™tbhy —0 or (a +3ab + 22) =0. 


Thus, (a + b)(a + 2b) = 0, which implies a = —b and a = —2b. If a = —b, then 
eTbx+by — e69-*) is a solution for any value of b. Likewise, if a = —2b, then 
e?0—2*) is also a solution for any value of b. Since the equation is linear and ho- 
mogeneous, sums of the solutions are solutions. For example, 3e?07® — 2630-) 4 
5e™-*) is a solution (among others). In fact, F(y — x), where F is arbitrary is a 
solution. Similarly, G(y — 2x) is a solution where G is arbitrary. 


The general solution is u = F (y — x) + G(y — 2x). [answer] 


5.2.2 Separation of variables 


In the separation of variables method, a solution can be expressed as a product of 
unknown functions, each of which depends on only one of the independent variables. 
The resulting equation must be separable, so that one side depends only on one vari- 
able while the other side depends on the remaining variables so that each side must be 
equal to a constant. By repetition of this method, the unknown functions can be deter- 
mined. Superposition of these solutions lead to the actual solution. This method often 
makes use of Fourier series, Fourier integrals, Bessel series, and Legendre series. 


Example 5.2.3. Vibrating string (wave equation) [solution by Fourier series]. 


A string of length L is stretched between points (0, 0) and (L, 0) on the x axis (see 
Fig. 5.3). At time ¢ = 0, it has a shape given by f(x), 0 < x < L, and it is released 
with an initial velocity g(x). Find the displacement of the string at any time later. 


Me 
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FIGURE 5.3 
Vibration of a string of length L. 


The equation of the vibrating string is 


3?u o2 3?u 


ae S ae O0<x<L,t>0, (5.13) 


where u(x,t) is the displacement from the x axis at time f. Since the ends of the 
string are fixed at x = 0 and x = L, 


u(0O,t)=u(L,t)=0, t>0. (5.14) 
The initial shape of the string is given by f(x): 

u(x,0)= f(x), O<x<L, (5.15) 
and the initial velocity is given by g(x): 


ə 
W Sua, =g) O<x<L. (5.16) 
at t=0 


Solution. 
To solve this boundary value problem, let u = X (x)T (t). Substituting this relation 
into the partial differential equation in Eq. (5.13) gives 


Tl xX’ 
XT" =CX"T o ==. 
eT X 
Letting the separation constant equal —A?, one has 

T"+X PT =0, X"+1?X=0. 


Note that choosing the separation constant equal to A? or zero leads to a trivial solu- 
tion X = 0 following application of the boundary conditions. Thus, 


T = A,sinAct + Bı cosàct, X = AzsinAx + Bocosax. 


el Eee 
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A solution is given by 
u(x,t) = XT = (A2sindx + B2 cos àx) (Aq sin àct + Bı cosAct). 
The boundary condition u(0, t) = 0 implies B2 = 0. Therefore, 
u(x,t) =sindx (Aq sin àct + Bı cosAct). 


From the boundary condition u(L,t) = 0, one therefore has sinAL (A, sinàct + 
T 
Bı cosàct) = 0 so that sinà L = 0, AL = nxn or à = a (n an integer) since the 


second factor must not be equal to zero. 


Thus, there are infinitely many solutions of Eq. (5.13) satisfying the boundary condi- 
tions in Eq. (5.14): 


: . NTX 
Un(X,t) = (An sin ànt + Bn cos ànt) sin Fz 


where A, = = (n = 1, 2...). Since Eq. (5.13) is linear and homogeneous, it fol- 

lows from the superposition principle that a solution to Eq. (5.13) is the infinite series 
= = nx 

u(x,t) = > Un(X,t)= = (An SiN Ant + Bn COSAnt) sin “ao (5.17) 


n=1 n=1 


Here to satisfy the initial conditions in Eq. (5.15) and Eq. (5.16), it is neces- 

sary to superimpose solutions. Note that u, are called the eigenfunctions and Àn 

the eigenvalues of the vibrating string. Using the initial condition in Eq. (5.15), 
CO 


_ NTX ee ee 
u(x, 0) = 5 B, sin ES = f(x), and from the theory of the Fourier sine series in 


n=1 


Section 2.5.6, 


2 eh | NTX 
B= = | f (x) sin — dx (n =1,2,...). (5.18) 
L Jo L 


Using the initial condition in Eq. (5.16), 


CO 
: . NTX 
= > (Anàn COS Ànt — Bnàn sin ànt) sin —— 
—0 L 
i n=1 1=0 
[0.6] 
. AX 
= > AndAn sin — = g(x), 
L 

n=1 
and using the theory of the Fourier sine series, 
L 


2 . NTX 
An = — g(x) sin — dx (n= 1,2,...). (5.19) 
cn Jo L 


OOOO. 
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Thus, Eq. (5.17) to Eq. (5.19) constitute a solution of Eq. (5.13) subject to conditions 
Eq. (5.14) to Eq. (5.16). [answer] 


Example 5.2.4. Heat conduction equation [solution by the Fourier series]. 


Find the temperature in a bar in which the ends of the bar are kept at zero temperature 
and the initial temperature in the bar is f(x) (see Fig. 5.4). 


0 x=L 
FIGURE 5.4 


Heat conduction in a bar of length L. 


The heat conduction equation for a temperature u(x, t) is 


ðu 3?u 
— =k, 0<x<L,t>0, (5.20) 
ot 3x? 
which is subject to boundary conditions 
u(0,t)=u(L,t)=0,t>0 (5.21) 
and initial condition 
u(x, 0)= f(x), 0<x< L. (5.22) 


Solution. 
To solve this boundary value problem, let u = X (x)T (t). Substituting this relation 
into the partial differential equation in Eq. (5.20) gives 

y" T” 

XT'=KX"T or — = —. 

X KT 
Each side must be equal to a constant —A? (note that choosing the separation constant 
equal to 47, the resulting solution does not satisfy the boundedness condition for real 
values of à) so that 


X"+ X=0, T4500. 
with solutions 
X =AjcosAx+ Bı sinàx, T= ce", 


A solution of the partial differential equation is thus given by 
u(x,t) = XT =(Acosax + B sin àx) e7". 
The boundary condition u (0, t) = Ae“ E) implies A = 0. Therefore, 


E : —Kì?t 
u(x,t) = B sinàxe ; 


el eee 
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Using the second boundary condition, u(L, t) = BsinAL eet = 0, If B= 0, the 
solution is identically zero, so one must choose sinAL = 0 or AL = nm, that is, A = 


T 
= (n = 1,2, ...). Hence, the eigenfunctions 


LATEX e2 
un (x, t) = By sin —— € n! è (n=l1,2...) 


are solutions of the heat conduction equation, with corresponding eigenvalues Àn = 
ni 


L 


To satisfy the initial condition, it is necessary to superimpose an infinite number 
of solutions: 


Co [0,6] 
| AIX 32 nit 
u(x,t) = Yo un, =Y Bn sine Raat (an = a) (5.23) 
n=l n=1 
From Eq. (5.23) and the initial condition, 
= nx 
u(x,0)= È` Bn sin F = fe). 
n=1 
The B,, coefficients are obtained from the Fourier sine series, 
2 fh | NTX 
B= > Ff (x) sin — dx (n=1,2,...). (5.24) 
L Jo L 


Thus, Eq. (5.23) and Eq. (5.24) constitute a solution of the problem. [answer] 


Example 5.2.5. Heat conduction equation [sinusoidal initial temperature]. 


Find the temperature u(x, t) in a lateral insulated copper bar 80 cm long if the initial 


temperature is 100 sin (=) °C and the ends are kept at 0°C. Physical data for copper 


i 
include: density of 8.92 = specific heat of 0.092 T and thermal conductivity 
cm g° 


cal 
of 0.95 


cm. s°C" 
Solution. 


The initial condition gives u(x,0) = vB sin Z> = f(x) = 100 sin(=) so 
g > = n 80 =. = 80 


n=1 
2 


K 
that Bı = 100, By = B3 =... = 0. In Eq. (5.23), A2 = — and x = — =0.95/ 
L2 op 


OOOO 
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2 
(0.092 x 8.92) = 1.158 ——. Hence, «A? = 1.158(s)?/80? = 0.001785. Thus, the 
S 


X —0.001785t 


ee T ae 
solution is u(x, t) = 100 sin 30 e , Where ¢ is in seconds. [answer] 


Example 5.2.6. Laplace’s equation [two-dimensional heat flow in steady state]. 


ðu 
If the heat flow is in steady state such that — = 0, the heat conduction equation 


reduces to the Laplace equation (in two dimensions): 


3u 8u 


Vu = 4 
“ TTE 


=0. (5.25) 


This equation must be solved for a given boundary condition on the boundary curve 
C for some region of the xy plane. This boundary value problem is called a: 


(i) Dirichlet problem if u is prescribed on C; 


m ð 
(ii) Neumann problem if un = = (that is, the normal derivative) is prescribed on C; 


x 
(iii) Mixed problem if u is prescribed on a portion of C and un on the rest of C. 


u=f(x) 


FIGURE 5.5 


Dirichlet problem for heat flow in a plate. 


For example, consider the Dirichlet problem in Fig. 5.5 for the rectangle R. The heat 
conduction equation in Eq. (5.25) is subject to the boundary conditions 


u(0, y) =u(a, y) =0 (5.26) 


and 
u(x,0) =0, u(x, b) = f(x). (5.27) 


Solution. 
Using the separation of variables, let u = X (x)Y (y). Substituting this relation into 
Eq. (5.25) yields 

xX” y" 


X"Y + XY" =0 or z= 7 zoe 


ee eee 
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Thus, X” + à? X =0, with X(0) = 0, X (a) = 0 (see Eq. (5.26)). This ordinary dif- 


nt 
ferential equation gives à = (=) and the corresponding nonzero solutions 
a 


_ NT 
X (x) = Xn (x) = sin — x (n =1,2...). 
a 
Similarly, the equation for Y becomes 
n a 2 nw \2 
Y” — 2Y =0, = (=) l 


with the solution 

Y(y) =Vi(y) = Aner + Bre. 
The boundary condition in Eq. (5.27) gives Y(O) = A, + Bn = 0, which implies 
Bn = — An. Therefore, the solution can be written as 


Y(y)=A* sinh” = (AX =2A,). 
a 


A solution of the partial differential equation is thus given by 
nex ny 
u(x, y) = Až sin —— sinh —. 
a a 
For the boundary condition u(x, b) = f (x), one can consider the infinite series 


[0,0] 


u(x, y) = J un, y). 


n=1 


= yoa UAD\N | nx : ar . 
Therefore, u(x, b) = >. A; sinh —— } sin — = f(x). Again, this is a Fourier 
a a 


n=1 


b 
sine series (with Fourier coefficients b, = A} sinh S Therefore, b, = 
a 
Ay sinn 7? = = F fœ) sin“ dx. 
The solution of the problem is 
ny 
= A% sin“ sin nh —, 5.28 
u(x, y) = as : (5.28) 
where 
* 
A; = a =h f(x) sin — * dx. [answer] (5.29) 
a sinh —— 


OOOO. 
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Example 5.2.7. Cylindrical heat conduction equation [solution by the Fourier— 
Bessel series]. 


An infinitely long cylinder of unit radius has a constant initial temperature uo. At 
time tf = 0 a temperature of 0°C is applied to the surface and is maintained. Find the 
temperature at any point of the cylinder at any later time rf (see Fig. 5.6). 


FIGURE 5.6 


Heat conduction in an infinitely long cylinder. 


For an infinitely long cylinder, the heat conduction equation is independent of z 
and ¢ and reduces to 


du _ & 1 du 


ie 572 ra), 0<r<1l,t>0, (5.30) 


subject to the boundary conditions 


u(l,t)=0, |u(r,t|<M (5.31) 
and initial condition 
u(r, 0) = uo. (5.32) 
The second boundary condition in Eq. (5.31) is a reflection that the temperature must 
be bounded. 
Solution. 


Using separation of variables, u = R(r)T (t), one obtains 


"i TOS: ieee T R” IR 42 
RT’ =kx|TR'+—R or = + =-)*. 
r KT R r R 


Thus, 
PR +rR +PVR=0, T+T =0, 


ee eee 
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with solutions as shown in Section 2.5.3 for the Bessel equation: 
R=AjJo(Ar) + BiYo(ar), T =c", 
A solution of the partial differential equation is thus given by 
u(r, t) = (AJo(Ar) + BYo(Ar)) e“. 


Since Yo(Ar) is unbounded as r approaches 0 (see Fig. 2.4) and |u(r, t)| must be 
bounded in Eq. (5.31), this condition implies B = 0. Therefore, 


u(r, t) = Ah Ar) e. 


Using the boundary condition u(1,t) = AJo(A) enki = 0, we obtain Jo(A) = 0, 
which has infinitely many zeros for A = àn (n = 1, 2, ...). Hence, the eigenfunctions 
are 


un(r,t) = AnJonre™ (n=1,2,...), 


with corresponding eigenvalues 4,,. To satisfy the initial condition in Eq. (5.32), it is 
necessary to superimpose an infinite number of solutions: 


(oe) 


[0,6] 
u(r, t) = X unr, t) = Ý Ano Anr) ea, 


n=1 n=1 


From the initial condition in Eq. (5.32): 


CO 
u(r,0) =) > AnJo(nr) = Uo. 


n=1 


Hence, this is a Fourier—Bessel series (see Example 2.5.12), and the coefficients Ay, 
are given by 


2 1 
= ira rf (r)Jo(Anr) dr 


=a ama fo EJo(E) dé. 


The second equality arises with f(r) = uo. Using the recursive formula for Bessel 


functions (see Section 2.5.3), ae [x"Jy(x)] = x”Jp—1 (x), or equivalently (with v = 1) 
x 


the relation f xJo(x) dx = xJı (x); therefore, A, becomes 


Quo hindi On) = Quo 
Wop O daJ An) 


n= 


re 
146 CHAPTER 5 Partial differential equations 


The solution of the problem is therefore 


[0,6] 


JoAnr) ext 
u(r, t) = 2uo 2 E e“ n, [answer] 


The above solution is identical to that given in Example 5.2.12 using Laplace trans- 
forms and the complex inversion formula. 


Example 5.2.8. Heat conduction equation [solution by Fourier integrals]. 


Consider the problem of Example 5.2.4 for heat conduction in a bar, but in this case 
the bar is semiinfinite in length. In this analysis, the Fourier series can be replaced by 
Fourier integrals (see Section 3.2.1). 


A semiinfinite bar (x > 0) whose surface is insulated has an initial temperature 
f(x). A temperature of zero is suddenly applied to the end x = 0 and maintained. 
Find the temperature u(x, t) at any point x at time t. The heat conduction equation is 


ðu 3u 
— =Kk—, x>0,r>0, (5.33) 
ot ax? 
with boundary conditions 
u(0, t) = 0, lju(x,t)|< M. (5.34) 
The initial condition is 
u(x,0) = f(x). (5.35) 


Solution. 
By separation of variables (see Example 5.2.4), 


u(x,t) = XT =(Acosix + B sin àx) e7". 
Using the first boundary condition in Eq. (5.34), A = 0 so that 
u(x, t) = B sinàx e", (5.36) 


Since there is no restriction on à one can replace B in Eq. (5.36) by a function B (à) 
and still have a solution. Furthermore, one can integrate over à from 0 to oo and 
still have a solution. This approach is the analogy of the superposition theorem for 
discrete values of à used in connection with a Fourier series. Thus, a possible solution 
is 


oo 7 
u(x,t) = f B(A) sinxe ^" dÀ. (5.37) 
0 


ee eee 
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Using the initial condition in Eq. (5.35), 
CO 
“œ= | B(à)sinàx dà = f (x). (5.38) 
0 
Recalling Fourier’s integral theorem in Section 3.2.1, 
[0,0] 
fx) a| [A (à) cos àx + B(à)sinàx] dr, (5.39) 
0 


where 


A(A) =f" f(x) cosax dx, 


1 re (5.40) 
BA) =: f f(x)sinàx dx. 
T J—oo 
Now since the function f(x) in Eq. (5.38) must be odd, A (à) = 0 and B(A) is 
2 f% 2 [° 
B(à) = =f f(x)sinàx dx = zf f(v)sinàvdv. 
T JO T JO 
Substituting this expression in Eq. (5.37) yields 
2 [9 ¢% 2 
u(x,t) = f Í fe sin àv sinax dadv. (5.41) 
T Jo JO 


The equation sin Àv sin àx = 5[cos A(v — x) — cos à (v + x)] implies 


u(x,t) = Ji [ fvye [eos A(v — x) — cosà (v + x)] dà dv 


1 fe Oe, secre 
=: f(v) p e t cosà (v — x)dàÀ 
T JO 0 


P9 2 
-f gA cosh +x)dà] dv. 
0 


: OO" ind 1 [m -8 
Using the result e cos à dà = 5 —e 4 , one finds 
0 a 


u(x,t) = : i Foje- E do- f foe = dv 
j ~ /aKt o 0 ` 


M EO” 
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v= v + 
Finally letting z 2) =o in the first integral and or) 


= w in the second one, 


u(x,t)= 


f ef (Qw/Kt +x) do — f. 


x 


2Jkt 2/K«t 


ef (Qw/ ct — z , 


Es 
Jr 


[answer] 
Moreover, in the case that the initial temperature f(x) = uo, which equals a constant, 


a ae x 
u(x,t)= 7 is g” do- f, e” in| = (+) [ e? do, 


N 


we have 


wal a] e eal a e 
VT) Jo 2/Kt 


2 X 2 
Here the error function is defined as erfx = Ta / e ° dw, whose value is tabu- 
T JO 


lated. 


Example 5.2.9. Neutron diffusion and buckling [solution of a two-dimensional wave 
equation for a cylindrical geometry]. 


Consider a cylindrical nuclear reactor with height H and radius R. To deter- 
mine the physical size of the reactor, one requires an evaluation of the so-called 
geometric buckling B* for the reactor. This quantity arises as the eigenvalue for so- 
lution of the wave equation: 


V-o+ Bb =0. 


The solution of this equation describes the steady-sate spatial distribution of the neu- 
tron flux @ in the reactor. To design a critical reactor, one sets B? equal to the material 
buckling B? (that is determined from the material properties of the moderator and fuel 
system) (Lewis et al., 2017). The term buckling arises from the situation of a loaded 
column, where equivalently the curvature of the flux must be contained within the 
physical boundaries of the reactor. 


(a) Solve the wave equation to determine the neutron flux distribution ¢. 

(b) Evaluate the geometric buckling B? as the lowest value of the eigenvalue. 
One can further optimize the physical dimensions of the reactor by optimizing 
the buckling equation, as shown in the Lagrange multiplier problem of Exam- 
ple 14.2.1. 


el eee 
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Solution. 
A cylindrical reactor is the most common geometry where the wave equation be- 


comes 
1a(,a\i 1 a? Z a? b 12% 
r = : 
ror or r2 092 Az? ý f 


Assuming azimuthal symmetry, the separation of variables technique yields 


a(r, z) = R(r)Z(2), 
where 


+2? =0. 


1 /ƏR 10R 1 3?Z 
R \ ðr? r or Z Oz 


As follows with this solution technique, 


arg 4 1 (R 1dR , 
= d + = 
Z dz? R\dr? rdr 


where A* = a? + f*. The eigenfunction solutions of these two ordinary differential 
equations are 


Zin (Z) = A1 COS(mZ) + By sin(Q@mz) and Ry (r) = CiJo(Bar) + Di Yo(Bur). 


If the reactor is symmetric about the z = 0 plane for the origin of the coordinate 
system located at the center of the reactor, Bj = 0. Furthermore, the neutron flux 
vanishes at the boundary H /2: 


Zm(H/2) = A; cos(@mH/2) = 0, Am = mrm/H, where m = 1,3,5.... 


Since Yo goes to infinity at r = 0 from Fig. 2.4, Dı = 0. The flux also vanishes at the 
radius r = R so that 


Rn(R) = CiJo(BnR) = 9, Bn =Xn/R, 


where x, are the zeros of the Jo function in Fig. 2.4. Hence, the eigenfunctions are 


Qı (r, z) = An cos (=) Jo (=r) f 


Using the lowest values of m and n, the neutron flux distribution is given by 


answer 
R [ ] 


$ (r, z) = A cos (32) Jo CE) . 


OOOO. 
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(b) The buckling B? is given by 


2  (2.405\* 
Be=)72= (=) + (=) ; [answer] 


5.2.3 Fourier and Laplace transform methods 


Operational methods (that is, the Fourier and Laplace transforms) can be used to solve 
partial differential equations. Here the transform of the partial differential equation 
and associated boundary conditions are first obtained with respect to one of the in- 
dependent variables. The resulting equation for the transform is solved and then the 
required solution by taking the inverse transform. 


Example 5.2.10. Heat conduction equation [solution by the Fourier sine transforms]. 


Consider Example 5.2.8 for heat conduction in a semiinfinite bar where 


ðu 3?u 
— =k—, x>0,t>0, (5.43) 
ot ax? 
with boundary conditions 
u(0, t) =0, lu(x, t) <M, t>0, (5.44) 
and initial condition 
u(x, 0) = f(x), 0<x<. (5.45) 


Solution. 
One can apply a Fourier sine transform with respect to x, since x varies from 0 to co 
and then solve the ordinary differential equation. 


Letting ús (w, t) = Fs {u(x, t)} and applying the property of the sine transform (see 
Section 3.2.3), 


Fy {ul (x)} = —w? F,{u(x)} + [o u(0), 


the transform of Eq. (5.43) becomes 


Ou. 2 
ie —Kw' tis +«,/— wu(0). 
ot T 


From the boundary condition in Eq. (5.44), 


ee eee 
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which has the solution 
i (w, t) = Cwe t, (5.46) 


The transform of the initial condition in Eq. (5.45) gives 
ûs (w, 0) = f(w). (5.47) 
Thus, (îs (w, 0) = C(w)e? = f(w), which implies C(w) = f(w). Hence, 
is (w, t) = fwe, (5.48) 


Taking the inverse sine transform of Eq. (5.48) gives 


PAE e 2 
“œ= f f(w) sinwxe*” ' dw. 
0 


Inserting in the Fourier sine transform of Ô (w), 


fw) = a’ f(v) sinwvdv, 
T JO 


gives 


= = i $ —kw?t 
u(x,t) = — f(v) sinwvdv | sinwxe dw 
T JO 0 
2 [0,6] lee) 2 
= =f 1 foe] sin wv sin wx dwdv. 
T Jo JO 


By changing the dummy variable of integration from w to A yields the final solution: 


2 aan f =K o; : 
u(x,t) = z f(vje sin àv sindx dà dv. [answer] 
0 0 


As expected, this solution is identical to Eq. (5.41) as obtained by the method of 
Fourier integrals in Example 5.2.8. 


Example 5.2.11. Heat conduction equation [solution by Laplace transforms]. 


Again consider Example 5.2.8 for the heat conduction in a semiinfinite bar, where the 
initial temperature distribution is constant such that f (x) = uo. Thus, the problem is 
defined by 

ðu 3?u 


a ae x>0, t>0, (5.49) 


with boundary conditions 


u(0, t)=0, ju(x,t)| <M, t>0, (5.50) 


OOOO 
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and initial condition 
u(x, 0) = Uo, O0<x <oo. (5.51) 


Solution. 
Similarly, one can use the Laplace transform method with respect to ¢ and then solve 
the ordinary differential equation. 


Letting U(x, s) = @{u(x, t)} and using the property 
Liu (t)} =sL{u(t)} — uO), 


the transform of Eq. (5.49) becomes 


U(x, 8) —u(x,0) = dU(x,s) 
sU (x, s) — u(x, 0) =«— > 
Using the initial condition in Eq. (5.51), 
dU 
KZ — sU = —uo. (5.52) 


The homogeneous equation of Eq. (5.52) is 


&U au U 
k—— —sU=0 or A2 


Ax =0 (where iS <) F 


which has the solution 
Un(x,5) = A(s)e 7T + B(s)ez. 
The particular solution of Eq. (5.52) is 
Uo 
U,(x,s)=—. 
p(x, s) F 
Therefore, the general solution of Eq. (5.52) is 


U(x,s)=Unt+ Up = A(s)e7E + B(s)et + Ho. 
S 


Since u(x,t) must remain finite (see the second boundary condition in Eq. (5.50)), 
U(x, s) must be finite as x —> oo. As such, B = 0 so that 


—#, Uo 
U(x,s)=A(s)e T + —. 
S 


Taking the transform of the first boundary condition in Eq. (5.50) gives U (0, s) = 0 
so that U (0, s) = A + “z 0. Hence A = ee The solution of the transformed 
s s 


ee eee 
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equation is therefore 


The solution is | u(x, t) = uoerf ( [answer] 


x) 


As expected, this solution is identical to Eq. (5.42) as obtained by the method of 
Fourier integrals in Example 5.2.8. 


Example 5.2.12. Cylindrical heat conduction equation [solution by Laplace trans- 
form and inversion formula]. 


Solve the heat conduction equation for the temperature in the infinitely long circular 
cylinder for the problem given in Example 5.2.7. 


Solution. 


ðu 
From Fig. 5.6, the heat conduction equation ono «Vu reduces to 


ðu e 1 ðu 
=K 


ae 572 ra), 0<r<1,t>0, (5.53) 


with boundary and initial conditions 


u(1,t) =0, |u(r,t)| < M and u(r, 0) = uo. (5.54) 
s : . du u 1du , 
Replacing «t by t in Eq. (5.53) yields = + , and taking Laplace trans- 
ət Ər? rər 
@?U PEOS 
forms, sU — u(r, 0) = —~ + ——., which implies 
“— dr? rdr 
Uo 
U 1dU 

sU =—Up. (5.55) 


OOOO. 
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U  1dU 
The solution of the homogeneous equation of Eq. (5.55), T2 +- — sU =0, is 
r r dr 
given by the Bessel function: 
Un(r, 8) = c1Jo (i/sr) + c2Yo (iVsr). (5.56) 
The particular solution is 
Uo 
Up, s)=—. (5.57) 
s 
Therefore, the general solution is 
u 
U(r, 8) = Un + Up = c1 (s)Jo (i sr) + €2(8) Yo (ir) + — (5.58) 


Since Yo (i./sr) is unbounded as r — 0 (see Fig. 2.4), c2 = 0 and Eq. (5.58) be- 
comes U(r, s) = c1Jo (i./sr) + ta. Now, from the boundary condition, U (1, s) = 0, 
S 


—Uo 


————.. Therefore, U (r,s) = 
sJo (ivs) m 


which yields c1Jo (i/s) + = = 0 so that cı = 
S 


Uo Uo Jo (i./sr) 
s s Io (ivs) ` 


By the inversion formula in Eq. (3.9), u(r, t) = uo — 


Uo i eI (i /sr) 

— a A ds. 
2ri y—ioo sJo (ivs) 

Using the residue theorem in Section 3.1.6, the function Jo (i v5) has simple poles, 
where i./s = A1, A2, ..., An in which A, is the nth root of Jo. Thus, the integrand has 
simple poles at s = —2, n=1,2,3,..., and at s = 0. The residues at these poles are 
the following: 


Residue of integrand at s = 0: 


lim =1. 
So mE) 
Residue of integrand at s = —A2: 
jm DEMOED hy (EHD) im [TREM] 
so sJo (ivs) s—>—ì? Jo (i V5) s—>—ì2 S 


I 
5 


i | 1 en (nr) | 
s>—2 Eis i —12 
An Jo (ivs) AT 


7 | 2e~*n' Jo (Anr) | 
E Àn J (Àn) ` 


el eee 
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In the derivation of the last line, l’ Hopital’s rule has been used in evaluating the limit 
and Jo’(u) = —J, (u). Then, 


oo oo 
Aa ‘To (nr) ew fat Jo e ^n Jo Anr) 
u(r, t) = uo ufi J- | =2u a i 
= iG = Telia 


Replacing t by «t yields the required solution: 


X JolAnr) 32 
u(r, t) = 2u SEN AT, [answer] 
? 3 Andi (Àn) 


As expected, this solution is identical to Example 5.2.7 as derived using a Fourier- 
Bessel series. 


Example 5.2.13. Surface heat conduction into a flowing fluid [solution by variable 
transformation]. 


An infinitely wide flat plate is held at a constant temperature Tọ. The plate is im- 
mersed into a laminar flowing fluid that has a constant density and thermal conduc- 
tivity k with a bulk temperature Tı. With the origin taken at the edge of the plate 
(see Fig. 5.7), the velocity distribution is Vy = By, Vy = 0, and V; = 0, where £ is a 
constant. Molecular transport of heat is assumed to only occur in the y direction. 


(a) Determine the temperature distribution within the fluid. 
(b) Determine the heat transfer coefficient between the fluid and plate. 


„<“ To on surface 


Tı Te eee Vox By 
Fluid direction 


Vj=V=0 
FIGURE 5.7 


(a) Schematic of a fluid flowing over a flat plate. (b) Volume element of the fluid (the depth 
along the volume element is taken as unity). 


Solution. 
A heat balance over the volume element at steady state gives 


Input = conduction from below and enthalpy flowing in from the left 


nn I 
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oT 
= —kðx. 1 -gy tOr Voy: 1-(T—Tref), 
Y. 


Output = [conduction from below plus conduction in the element] 
+ [enthalpy flowing in from the left and enthalpy in volume element] 
oT oT 
= — | kox— + ð | kdx— | | +CpVxpðy(T — Tref) 
dy dy 
+ 0[Cp Vx pdy(T — Tref)], 


where T is the temperature of the fluid at x and y and Cp is the specific heat. Since 
the accumulation at steady state is Accumulation = Input — Output = 0, we have 


oT 
ð (a) = ð0[CpVxpðy(T — Tref) 
y. 


or 


= e) = 2 (CpVxpyT). 
Given Vy = By and £, k, Cp, and p are constants, one obtains the partial differential 
equation for the following problem: 
dT APT 
Bx y By? 


where A = . This problem is subject to the following boundary conditions: 


pP p 
(i) leading edge of plate: x = 0, y > 0,T = Ti, 
(ii) above the plate: x > 0, y = œ, T = T}, 
(iii) on the plate: x > 0, y = 0, T = Tp. 


-T 
7 , the differential equation becomes 
1 


T 
With a change of variables, 7 
A30 30 
y dy2 Ax’ 
The boundary conditions are therefore: (i) x = 0, y > 0, T = Tı, 0 = 0, (ii) x > 0, 


y=, T = T, 0 =Q, and (ii) x > 0, y = 0, T = Tọ, 0 = 1. Moreover, given these 


boundary conditions, one can assume a solution of the form 6 = f ( =) = f(n) 


with boundary conditions (i) 6 = 0,x = 0, y > 0, which implies 7 = ov, and (ii) 
0 =1,x >0,y=0, which implies 7 = 0. 


d0 dn _—nyd@ _ —nn d0 00 


For this transformed equation, = = = and = 
əx dnox  x”+l dny x dn dy 
d0 ð 1 do 30 
e a —. Taking the derivative again for the latter equation, —5 = 
dn oy x” dn dy? 


ee eee 
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1 3 dð 1d76dn 1 d’0 
x" dydn  x"dn2 dy x” dn? 
—nnd@ A 1 d’0 

dn = y x2n dn?” 


. Thus, substituting these expressions into the 


differential equation yields Since 7 = yx”, this equation 


becomes 


—nnd@_ A 180 
x dn x3" ndn? 


Setting n = 1/3 so that the factor x cancels on both sides of the equation gives the 
ordinary differential equation 


PO do | 


er ) 
dn? 3Adn 


do d 2d 
Setting — = v and separating variables gives f es f dila On integrating, 
dn v 3A 
dé 3 3 
the solution is v = da = Be~7/A) or i d0o=B f} et OM dn, Integrating again 


gives0=B f Ka en /OA g n. Applying the boundary condition that 6 = 1 when n = 


0, the constant B is evaluated as B = . Therefore, the final solution 


in eT /OA) dn 
is 


EEK.: 
7 T-T, fe 1/04 dy 
~T—-T JSE e-P/O4) dn 


(b) The heat transfer coefficient h at the surface of the plate can be determined by 
equating convection from the surface to conduction through the fluid: 


oT 
nm- m= (25) : (5.59) 
dy y=0 
3 
dT 90 30 1 dð  Be™™/0A __. 
From part (a), ay = ay te — Tı], where By = a dn = ye Using the 


expression for B from part (a) and substituting these two derivatives into the heat 
balance in Eq. (5.59) gives 


-k  e7/@A) 
~ aI fE eO dy 


h 


M UU 
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Integrating and evaluating the heat transfer coefficient at the surface of the plate y = 
n=0, 


1/3 
h =0.43k (=) f 
kx 


Problems 


5.1 Consider a solid ball of radius R, with a constant thermal diffusivity «. Initially, 
the ball has a uniform temperature uo. If this ball is dropped into an ice-water bath 
of temperature zero, show that the temperature distribution u(r, t) in the ball can be 
described by the following partial differential equation when only radial heat conduc- 


— ; ðu 1 ð ( ,0u ; ee is 
tion is considered: =K r , With the initial condition u(r, t) = uo, 
ot r2 or or 


0 <r < R, t = 0, and boundary conditions u(r, t) is finite at r = 0, t > 0, and 
u(r, t) =0,r=R,t > 0. 
Using the transformation € = ur, show that the partial differential equation becomes 


ð 
= =k subject to an initial condition &(r,t) = uor, O0 < r < R, t = 0, and 
boundary conditions é (r, t) = 0, r = 0, t > 0, and (7, t) = 0, r = R, t > 0. Solve 


this transformed problem for é (r, t). What is the corresponding solution for u(r, t)? 


5.2 A chemical pollutant is released into the atmosphere, where it diffuses radially 
with a constant diffusion coefficient D. It can be assumed that the initial concentra- 
tion of the pollutant is zero, where the pollutant is a point source of strength S (atoms 
s—!). The diffusion equation for the pollutant concentration C (r, t) (atoms m-°) is 


given by 
aC 1 0 f ,0C 
=D r ; 
ot r? or or 


The source condition can be represented as a boundary condition at r = 0. Using 


aC 
Fick’s first law of diffusion, where the flux F = — D —, the release rate of the pol- 
lutant (at r = 0) is equal to the source strength such that lit (4nr?F ) = S. The 
r= 


other conditions follow for the given physical problem: C = 0 as r —> oo, t > 0, and 
C=0,0<r<œ,t=0. 

(a) As proved in Problem Th with the transformation € = Cr, the diffusion 
equation reduces to - = a Using this transformation, show that the ini- 
tial and boundary conditions become &(r,t) =0, 0<r < R, t = 0, and (r,t) = 
——, r= 0, t > 0, and (r,t) is finite as r > œœ, t > 0. Using a Laplace trans- 


form method, show that the solution, subject to the given conditions, is given by 


D 


Problems 159 


S r 
&(r, t) = ——erfc (=). where the latter function is the complementary error 
4r D 24 Dt P 7 


function. 

(b) What is the corresponding solution for C (r, t)? 

(c) Briefly mention why the method of separation of variables is inappropriate for 
this given problem. 

(d) Assuming a diffusivity D = 107? m~? s7! and a source strength S = 
100 atoms s~!, what is the concentration at a distance of 100 m from the point source 
after 104 s? 


5.3 Consider the radial diffusion of material in a sphere of radius a, with a con- 
centration distribution C (r, t) and constant diffusivity D. It is assumed that: (i) the 
concentration at the center of the sphere is finite for t > 0, (ii) the surface of the 
sphere is maintained at a zero concentration for t > 0, and (iii) initially there is a 
uniform concentration Co throughout the sphere 0 <r <a. 


. . r Dt Cr C 
Defining the variables x = —, tT = —-, and u = — — = —x, the transformed prob- 
a a? Coa Co 
. ðu u oe 
lem is — = Pa) with conditions u(x,t) =x,0 <x < 1, t =0, u(x,t) =0, x =0, 


t > 0, and u(x,t) =0,x =1,1t>0. 
Using the Laplace transform method with respect to the variable t, such that 
U(x,s) = L{u(x,t)}, show that the transformed solution is U(x,s) = 


1 sinh x./s 
ll 5b ashe ENE 
sinh V/s 


(a) The release fraction F(t) can be defined as the total amount of material which 
has diffused through the surface of the sphere at time t divided by the initial amount 


t 
4ra? f J (tdt 
0 


| What are the corresponding solutions for u(x,t) and C(r, t)? 
s 


of material in the sphere, F(t) = 


t 
= / J(t)dt. The flux of 
—ma°Co aCo Jo 
3 
oC (r,t) 


ðr r=a 


3 T T 
Hence, show that F(t) = zs. | J(t)dt = -3 f (=) dt, and defining the 
DCo Jo 0 x=1 


material J (t) is evaluated from Fick’s law of diffusion: J (t) = — D 


x=1 

(b) Using the solution for U(x, s) and the latter expression for F (s) from part (a), 

coth/s 1 

show that F (s) = 3 a — =| 

(c) An infinite series will eventually result for the release fraction F(t) which is 
more difficult to evaluate. However, an analytical form for F(t) is possible by con- 
sidering a “short-time” approximation for the condition that t << 1. Show that this 
condition is equivalent to s >> 1 in Laplace transform space. Determine an analytic 


OOOO. 
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expression for F(t) by applying the condition s >> 1 and taking the inverse trans- 
form of this resultant expression. 


5.4 A standard model for describing the dispersion of chemical pollutants in the 
atmosphere is the Gaussian plume model. In this model it is assumed that the wind 
carries the pollutant in the x direction with an average wind speed u, while the mate- 
rial diffuses in the other directions. In the case of a plane source of pollutant at x = 0, 
assuming only diffusion in the y direction (with a diffusivity D), the steady-state 


E 
transport equation for the concentration C is u ag De Using the transforma- 
x y 
2 
tion t = x D/u, this equation can be written as Or ye’ subject to the conditions 
T y 


C = Sd(y), ~œ < y < œ, T =0, 


aC 
C is finite at y = 0, which implies ay =0, y=0,Tt > 0, 
y 


C=0, y=+o00, t>0. 


The condition at t = 0 accounts for the plane source S of material being released into 
the atmosphere (that is, at x = t = 0). Here 5(y) is a so-called distribution function 


[00] 
which has the property hat f ôl(y—a) f (y)dy = f(a) and f Sd(y)dy=S. 
CO =09 


(a) Using a Fourier cosine transform method such that Ĉ.(w, T) = Fe{C(y,T)}, 
show that the transformed partial differential equation is 


IĈ. (w, A [2 ac 

COUT) a Gai ae . 
OT TT oy y=0 

(b) Using the given conditions and the property of the distribution function, show 


= 
=w T 


e 
~V 2 


(c) Taking the inverse Fourier cosine transform of the expression in part (b), show 


that the transformed solution in part (a) is C (w,t)= 


S 
that the final solution is given by C (y, t) = aa (41). Why is the method of 
IT 


separation of variables inappropriate for this problem? 


5.5 The model in Problem 5.4 can be generalized further to account for diffusion in 
both the y and z directions, with a time-dependent diffusive-convective equation for 
the concentration of the pollutant in the atmosphere C(x, y, z) (atoms m~?) and point 
source of pollutant S (atoms s7!). If the movement of the effluent in the x direction 
from the wind (with a constant speed u m/s) is much greater than that of diffusion, 
vc ac ac 
ee oe h ), 
where D is the diffusion coefficient (m? s~!). For steady-state conditions, assuming 
C 8C 8C 
ax dy = 3z? 


0 
the following isotropic diffusion equation applies: a. D ( 


D = 1 m? s™! and u = 1 m/s, this equation simplifies to 


D O 


Problems 161 


(a) Separating the variables where C(x, y, z) = C1 (x, y) - C2(x, z), show that two 
aC, 8C 3C 8O 
= and = : 
dy? ax 3z? 
(b) Because of the similarity of these two equations, one need only consider a 
solution for one (since the other follows by symmetry). The boundary conditions 
for the given y region are Cı(x, y) < M at y = 0, where M is a constant since 


‘ : : fhe Ad oth ac 
the Cı concentration must remain finite at y = 0, which implies that = 


y y=0 

0, x > 0. The second boundary condition Cı(x, y) =0, y= œ, x > 0, fol- 
lows since far away from the source, the concentration must approach zero. Using 
a Fourier cosine transform method such that c(x, w) = F.{Ci(x, y)}, show that 


partial differential equations result: 


A 
the solution of this problem is C(x, y) = EL exp l-2 l, where A, is an arbi- 
X x 


trary constant. Note the following definite integral has been used in this derivation: 


PS da 1 Jm _2 
e cos (bw)dw = =,/—e %. 
Tð 2Va 


(c) The solution for the z direction follows on replacing the variable y with z 


in the solution of part (b) because of the symmetry of the problem C2(x, z) = 
2 


A 
fad! exp ae . The final solution for C(x, y, z) therefore follows as C(x, y, z) = 
f/x 4x 


A 1 
Ci(x, y) - Co(x,z) = fal- p+}, where A = A, - A2. The arbitrary 
x x 


constant A follows from a conservation of mass using the source condition for the 
pollutant at x = 0. Hence, evaluate A by integrating the flux of material [u -C (x, y, z)] 
(where u = | m/s) over all space in the y and z directions and equating this result 


[0.0] OO 
to the quantity S: im f I [1- C(x, y, z)]dydz = S. Hint: Note the definite 
x VU J—oo J—00 


[0.0] 
integral result f ede = =, This analysis therefore yields the steady-state 
a 


—oo 
Gaussian plume model for the dispersal of a pollutant with diffusion in the y and z 
directions and convective transport by the wind in the x direction. 


5.6 Consider heat conduction in a semiinfinite slab, where the slab is initially (t = 0) 
at a constant temperature um and the end of the slab (at x = 0) is maintained at 
the constant temperature uw. The partial differential equation for the temperature 
distribution u(x, t) for thermal diffusivity « is 


ðu 3u 


San a)? 0, t 0, 
a K z x> > 
with u(x, 0) = um, x > 0, u(0, t) = Uy, t > 0, |u(x, t)| < M (that is, u is finite as 
x — œ, t > 0). Using a Laplace transform method: 
can bene, 


(a) Show that the transformed solution is U(x, s) = (Uw — Um) í + 
s 


OOOO... 
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-a/s 
(b) Using the inverse Laplace transform #7! £ = erfc (5) show 
s 2/t 


that the solution is 
u(x,t) = ‘ua 5)} = (Uy — Um )erfc +u 
, , w m hot m: 


5.7 A problem of interest to chemical and material engineers is the melting of a ma- 
terial. This problem can be solved as a moving-boundary heat conduction problem as 
a so-called “Stefan problem,” where the melt front (that is, the liquid/solid interface) 
progresses with time. Consider a slab of material which is initially at the constant 
temperature um at t = 0. At time t > 0, the temperature at the wall (x = 0) is raised 
to a value above the melting temperature of the material, uw. At the liquid/solid in- 
terface, x = s(t), the temperature is equal to um. These boundary conditions can be 
written mathematically as 


u(0, t) = uw, t>0, 


u(x, t)|x=s(7) =Um, t>0. 


As shown in Problem 5.6, the general solution of the heat conduction equation is of 


the form A + Berfc == , where A and B are constants determined from the 
2/7 Kt 


boundary conditions. 


(a) Show that by applying the boundary conditions, the solution of this moving 


x 
u(x,t) —Uuy 7 ert( =) 


mT Uw 7 erf(y) 
tion for s(t) indicates the position of the moving liquid front. 


boundary problem is , where s(t) = 2y /xt. The rela- 


(b) Assuming that heat transfer occurs in the liquid phase by conduction only, 
the constant y can be determined from an energy balance across the interface: 


d d 
-K es = tL. Using the definition of the thermal conductivity K, show 
dx | = s(1) dt 
: du ds i ; 
that this energy balance becomes —ko — = L—. Here t is the density, 


x x=s(t) 
k is the thermal diffusivity, ø is the specific heat for the liquid phase, and L is 


the latent heat of melting/solidification. Show that by applying this condition with 


the solution for u(x,t) one obtains the following transcendental equation for y: 

o (Uw —U 

Eu a As shown in Problem 9.2, various approximate meth- 
L/a 


ods can be used to numerically solve this transcendental equation. 


ye” erf(y) = 


(c) To numerically solve for y in part (b), one requires that the error function 
2 Y 
erf(y) = — / ge du be evaluated. Using a Gaussian integration rule (with n = 
T 


2 terms) (Section 9.5.3), derive an analytic relation for erf(y). 


so OOE O 
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5.8 Give a brief physical interpretation of the following partial differential equa- 
tions. Classify each equation as elliptic, hyperbolic, or parabolic by comparing it to 
the general form of a linear partial differential equation with two independent vari- 


ables. ; ; 
(a) 3 + = =0. 
ðu Ə?u 
(b) ar =K a 
(c) ae = oe 


5.9 Consider the heat conduction in a thin insulated bar of length 3 m where the 
initial temperature at t = 0 is f(x) = 15 — 10x°C and the ends of the bar are kept 
at 0°C. The partial differential equation for the temperature distribution u(x, t) at the 
distance x and time f in the bar is therefore given by S = oo Here the thermal 
x 
diffusivity is simply equal to 1 m* s~!. The boundary conditions for this problem 
are given mathematically by u(0, t) = 0, t > 0, u(3,t) = 0, t > 0, and the initial 
condition is u(x,0) = 15 — 10x, for 0 < x < 3, t = 0. Solve this problem using a 
separation of variables technique. 


5.10 Solve Problem 5.9 using a Laplace transform method with the variable trans- 
formation U(x, s) = @{u(x, t)}, where 
(a) the transformed solution for U (x, s) is given by 


sinh (x./s) m sinh ((x — 3)/s) | 15— 10x 
s sinh (3,/s) s sinh (3/5) f 


(b) By taking the inverse Laplace transform of U (x,s) in part (a), what is the 
solution for u(x,t)? 


S 


væs=is] 


5.11 Consider the heat conduction in a thin insulated bar of length 3 m where the 
initial temperature at t = 0 is 25°C and the ends of the bar are kept at 10°C at x = 0 


and 40°C at x = 3. The partial differential equation for the temperature distribution 
2 


u 
u(x,t) at the distance x and time ¢ in the bar is given by ap gt Here the thermal 
x 


diffusivity is simply equal to 1 m? s~!. The boundary conditions for this problem are 

given by u(0, t) = 10, t > 0, u(3, t) = 40, t > 0, and the initial condition is u(x, 0) = 

25, for0 <x < 3, t = 0. To solve this complicated boundary value problem, assume 

that u(x, t) = v(x, t) + w(x), where w(x) is to be suitably determined in part (a). 
(a) Explain why u(x,t) can be written as two separate functions. Show that the 

boundary value problem can be rewritten as two separate problems: w’(x) = 0, 

2 
w(0) = 10, w@) = 40, and = = with v(0,t) = 0, v(3,t) =0, v(x,0) = 


ot ax2 
15 — 10x. 


OOOO. 
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(b) What is the solution for w(x)? The solution for y (x) is in fact needed to obtain 
the initial condition for v(x, 0). 

(c) The partial differential equation problem for the function v(x, t) is in fact the 
same boundary value problem as given in Problems 5.9 and 5.10. Using this solution 
for v(x, t), give the complete solution for u(x, t). 

(d) What is the physical significance of the solution for w(x)? 


5.12 The slowing down of neutrons in the moderator of a nuclear reactor can be 
described by the slowing down density q (neutrons m~? s7!) in accordance with the 
ðq ə? 
so-called Fermi age equation, = = = where T is the “age” of the neutron (in this 
T x 
theory, the unit of t is m°). This equation can be solved assuming a plane source 
of neutrons S (neutrons m~? s7!) in the yz plane at the origin of the coordinate 
system, where x is the distance from the plane source. The boundary conditions for 
this equation consider symmetry at x = 0 and require that: (i) q remains finite such 
ð age ; 
that af = 0, t > 0, and (ii) q is finite as x > œ, t > 0. 
X |x=0 
(a) Using a Fourier cosine transform on defining Q (w, t) = Fe{q(x,T)}, show 
2 
that this problem yields the transformed solution: Q(w,t) = A; e~” *, where A, is 


an arbitrary constant. 
(b) Taking the inverse Fourier cosine transform, show that the solution for q(x, t) 


. A 2 : 
is q(x,T) = ee /(41) where A is a constant. 
T 
(c) Using a source condition that follows from the conservation of mass S = 
lee) 


q(x, t)dx, evaluate the constant A. This source condition replaces the initial 
—CO 
condition for t = 0. 


5.13 Consider the solution of d’ Alembert’s for the wave equation as mentioned in 
Section 1.1.1. Here a solution to the wave equation 


u(x,t) 1 d°u(x,t) 


ax2 ceo Oot? 


is given by 
u(x,t) = F(x+ct)+ G(x —ct), 


where F(x + ct) and G(x — ct) are arbitrary functions. This solution is obtained by 
changing the variables x and t to w = x + ct and z = x — ct, respectively, considering 
that u(x,t) = u(w, z) for the partial differential equation. 


CHAPTER 


Difference numerical 
methods 


Finite difference methods can be used to numerically solve first- and second-order 
ordinary differential equations (Section 6.1), as well as partial differential equa- 
tions (Section 6.2), particularly when analytical methods are not feasible. This is a 
very common method used for the solution of engineering problems that commonly 
arise in real-world situations where the coefficients of the differential equation them- 
selves may be a function of the dependent variable. For the solution method, there 
is a discretization of the differential equation using finite differences to approximate 
derivatives. This method essentially represents three (or less) spatial dimensions and 
perhaps a time dimension as arrays of mesh points, defined from a mesh spacing that 
is uniform or variable in size. This method can be alternatively compared to the finite 
element numerical method as described in Chapter 7. Numerical techniques devel- 
oped here can also be used for the solution of nonlinear partial differential equations 
in Chapter 12 or utilized as an approach to linearize nonlinear equations. 


6.1 Ordinary differential equations 
6.1.1 First-order equations 


Numerical methods can be used to solve the initial value problem 


y =f, y), y(xo) = yo. (6.1) 


One-step methods 
The problem in Eq. (6.1) can be solved by truncating the Taylor series 


h2 
yx +h) = y(x) + hy) + q7 t+ eo (6.2) 
D Wa ,_ of of 1 
where by Eq. (6.1), y = f, y” = f= ax + ay , and so on. In a one-step method, 
x y ——— 


one starts from yọ = y(xo) and proceeds stepwise computing approximate values of 
the solution using a Taylor series approximation at given “mesh points”: 
xı = Xo +h, x2 = xo + 2h, x3 = xo + 3h, ..., (6.3) 
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where A is a fixed step size. 
(i) Euler method 


Truncating Eq. (6.2) after the term hy’ yields the Euler method in which one com- 
putes in a step-by-step method: 


yı = yo thf (Xo, yo), 
y2 = yı +Af (x1, y1), 


Yn+1 = Yn + hf (Xn, Yn) (n=0, 1,...). (6.4) 


This method is of first order since only constant terms are considered with a term 
containing the first power of h. The truncation of the series in Eq. (6.2) results in a 
local truncation error of order h2. In addition, there are round-off errors in this calcu- 
lation and other methods. 


Example 6.1.1. Solve the initial value problem y’ = x + y+1, y(0) =0for0<x <1 
with h = 0.2 using an Euler method. 


Solution. 
Here f(x, y)=x-+y-+1 and Eq. (6.4) becomes 


Yn+1 = Yn + 0.2(xn + Yn + 1). 


The analytical solution is obtained with an integrating factor of e~* as derived from 
the methodology in Chapter 2, yielding y(x) = 2e* — (x +2). The numerical solution 
using this step-by-step equation is given in Table 6.1. 


Table 6.1 Example calculation for the Euler method. 


n Xn Yn 0.2(xn + yn +1) Exact value Error 
6) 0.0 0.000 0.200 0.000 0.000 
1 0.2 0.200 0.280 0.243 0.043 
2 0.4 0.480 0.376 0.584 0.104 
3 0.6 0.856 0.491 1.044 0.188 
4 0.8 1.347 0.629 1.651 0.304 
5 1.0 1.977 2.437 0.460 


(ii) Heun’s method 


Taking one more term into account in Eq. (6.2) yields the Heun’s method, which 
avoids the computation of calculating the derivatives of f with the use of an “auxil- 
iary” equation, that is, 


Yal = yn thf (Xn, Yn), (6.5a) 
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1 
Yn+1 = Yn + FALF (Ans Yn) + Fn» Yntvl- (6.5b) 


This method is a predictor-corrector method because in each step a value is predicted 
in Eq. (6.5a) and then corrected in Eq. (6.5b). The truncation error per step for this 
method is obtained as follows. Setting fn = f (xn, y(xn)) and using Eq. (6.1) and 
Eq. (6.2), 


sg 1 i 1 = 
y(Xn +h) — yn) = h fa + ah hh + gh ft FE eas (6.6a) 
and it follows that 


Yn+1 — Alfa + faril = hl fa + (fa +h fy + r at (6.6b) 
Subtracting Eq. (6.6a) from Eq. (6.6b) gives the truncation error per step 


3 3 
eye = it = h” Fn 4. 
J -5 ph PP 


Thus, Heun’s method is a second-order method because the local truncation error is 
of the order h?. 


Example 6.1.2. Solve the problem in Example 6.1.1 by Heun’s method. 
Solution. 


Letting kı = Af (Xn, Yn) = 0.2 (xn + Yn + 1) and k2 = hf (xn+1, Yn + k1) = 0.2 (xn + 
0.2 + yn + 0.2 (xn + yn + 1) + 1), Eq. (6.5b) becomes 


1 0.2 
Yn+1 = Yn + z% +k) = yn + y 2n +2.2yn + 2.4) 
= yn + 0.22 (xn + yn) + 0.24. 


The results in Table 6.2 are more accurate than those in Table 6.1. 


Table 6.2 Example calculation for the Heun method. 


n Xn Yn 0.22 (xn + Yn) + 0.24 Exact value Error 

0 0.0 0.0000 0.2400 0.0000 0.0000 
1 0.2 0.2400 0.3368 0.2428 0.0028 
2 0.4 0.5768 0.4549 0.5836 0.0068 
3 0.6 1.0317 0.5990 1.0442 0.0125 
4 0.8 1.6307 0.7747 1.6511 0.0204 
5 1.0 2.4054 2.4366 0.0311 
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(iii) Runge—Kutta method 


Truncating after the term in h* gives the Runge-Kutta method. Again, in this method, 
the calculation of the derivatives in Eq. (6.2) is replaced by the evaluation of f (x, y) 
at suitable points (x, y) with the following use of four auxiliary quantities: 


kı = hf (Xn, Yn), 


1 1 
k2 =hf (Xn + ai Yn + 3%) 


1 1 
k3 =hf (Xn + ais Ynt z2) 


ka =hf (xn +h, yn + k3), (6.7a) 


and then from these quantities we have the new value 


1 
Yn+1 = Yn + 6 (kı + 2k2 + 2k3 + k4). (6.7b) 


Example 6.1.3. Solve the problem in Example 6.1.1 by the Runge-Kutta method. 


Solution. 
For the present problem f(x, y) =x + y + 1 and h = 0.2. Therefore, Eq. (6.7a) gives 


ki = 0.2(xn + yn + 1), k2 = 0.2(xn + 0.1 + yn +0.5kı + 1), 
k3 = 0.2 (xn + 0.1 + yn +0.5k2 + 1), k4 = 0.2(xn + 0.2 + yn + k3 + 1). 


The results using Excel analysis in Table 6.3 are more accurate than those in 
Table 6.1 and Table 6.2. 


Since the error is of order h> for this method, in a switch from A to 2h, the error is 
multiplied by 2° = 32. However, one needs only half as many steps as before so that 
the error will only be multiplied by 32/2 = 16. In fact, the error € of an approximation 


y obtained with step h equals about 1/15 the difference 6 = y — y of corresponding 
approximations obtained with steps h and 2h, respectively: 


c= 75 (5-5). (6.8) 


Multistep methods 
The previous methods are known as one-step methods because each step uses only 
values obtained in the preceding step. In contrast, methods that use values from more 


than one preceding step are called multistep methods, which avoid computations such 
as Eq. (6.7a). 


Table 6.3 Example calculation for the Runge-Kutta method. 


n | Xn Yn ky ko k3 k4 l(k + 2k +2k3 +k4) | Exact Error x 10° 
O | 0.0 | 0.000000 | 0.200000 | 0.240000 | 0.244000 | 0.288800 | 0.242800 0.000000 | 0 

1 | 0.2 | 0.242800 | 0.288560 | 0.337416 | 0.342302 | 0.397020 | 0.340836 0.242806 | 6 

2 | 0.4 | 0.583636 | 0.396727 | 0.456400 | 0.462367 | 0.529201 | 0.460577 0.583649 | 14 

3 | 0.6 | 1.044213 | 0.528843 | 0.601727 | 0.609015 | 0.690646 | 0.606829 1.044238 | 25 

4 | 0.8 | 1.651042 | 0.690208 | 0.779229 | 0.788131 | 0.887835 | 0.785461 1.651082 | 40 

5 | 1.0 | 2.436502 2.436564 | 61 
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By integrating cubic spline polynomials (see Section 9.3), one can obtain the impor- 
tant Adams—Moulton multistep method (Kreyszig, 1993). In this method, one first 
computes the predictor 


h 
Yay = Yn + 74 (55 fn — 59 fn—1 + 37 fn-2 — 9 fn—3) , (6.9a) 


where f; = f (xj, yj), and then from it the corrector (the actual new value) 


h 
Ynti = Yn + 3F (9 Fa E19 fa Shea + fa-2); (6.9b) 


where fi", = f n+1, Y;41)- To get started, y1, y2, and y3 must be computed by the 
Runge-Kutta method or some other method. This method is like the Runge-Kutta 
method that is of fourth order; however, it is faster because only two new values 
of f are needed per step in Eq. (6.9a) and Eq. (6.9b), as opposed to four values in 
Eq. (6.7a). Predictor-corrector methods can also provide an estimate of the error. If 
|Yn — y;| is large, then the error of y, is probably large and h should be reduced. On 


the other hand, if y, is small, then h may be increased (e.g., doubled in size). 


Example 6.1.4. Solve the problem in Example 6.1.1 on the interval 0 < x < 2 by the 
Adams—Moulton method with h = 0.2. 


Solution. 

Using Eq. (6.9a) and Eq. (6.9b) and the Runge-Kutta results in Table 6.3 for the 
starting values y1, y2, and y3, the results are shown in Table 6.4 using Excel analysis. 
It is seen that the corrections improve the accuracy considerably. 


6.1.2 Second-order equations 


Numerical methods can be used to solve the initial value problem 


y" = f(x,y, y), y(%o) = Yo, y (xo) = Yh, (6.10) 


such as the Runge—Kutta—Nystr6m method. This is a fourth-order method that is an 
extension of the Runge-Kutta method in Section 6.1.1. Again, one computes four 
auxiliary quantities: 


1 
kı = ah f On, Yn, y), 


1 1 1 1 
ka= hf Gn + zh Yn + K, Yn + kı) where K = 5On + 5h), 


1 1 
k= zS Gn F ait Yn+ K, y, +k), 


Table 6.4 Example calculation for the Adams—Moulton method. 


n Xn Starting yn Ín yn ifs Corrected yn Exact Error x10° 
(0) 0.0 0.000000 1.000000 0.000000 0 

1 0.2 0.242800 1.442800 0.242806 6 
2 0.4 0.583636 1.983636 0.583649 13 
3 0.6 1.044213 2.644213 1.044238 25 
4 0.8 3.451056 1.650720 3.450720 1.651056 1.651082 26 
5 1.0 4.436537 2.436129 4.436129 2.436537 2.436564 26 
6 1.2 5.640208 3.439708 5.639708 3.440208 3.440234 26 
7 1.4 7.110377 4.709766 7.109766 4.710377 4.710400 23 
8 1.6 8.906046 6.305300 8.905300 6.306046 6.306065 18 
9 1.8 11.099284 8.298373 11.098373 8.299284 8.299295 10 
10 2.0 10.777001 13.777001 10.778114 10.778112 —2 
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1 
ka= shf On +h, yn tL, y, + 2k3), where L = h(y!, + k3), (6.1 1a) 


and then from these quantities the new approximate value y„+1 of the solution y, 


1 
yn = nth (y+ 30k +k), (6.11b) 


as well as an approximation of the derivative y’ needed in the next step, 


1 
Yn+1 = In + z (kı + 2k2 + 2ks + ka). (6.110) 


Example 6.1.5. Solve the initial value problem y” = 5(x +y+y +1), y0)= 
0, y’(0) =0 for 0 < x < 1 with h = 0.2, using the Runge—Kutta—Nystrém method. 
Using the method of undetermined coefficients from Chapter 2, the particular solution 
of this differential equation is y? = —x. The solution of the homogeneous equation 
is y” = Ae* + Be~*/*. Hence, applying the boundary conditions, the exact solution 
is y(x) = $ [e* —e*/ 2] — x. The numerical solution can therefore be compared to 


this exact analytical solution. 


Solution. 
From Eq. (6.11a) we have 


kı =0.05(n + yn + y} + 1), 
1 
kp =0.05(xn +0.1 + yn + K +y, +kı +1), K =0.1(y/ + z4) 


k3 =0.05(xn + 0.1 + yn + K + yn +k2+ 1), 
k4 = 0.05 (x, + 0.2 + yr + L + y, +2k3 + 1), L= 0.2%; + k3). 


The computations based on Excel analysis are shown in Table 6.5, and compared to 
the exact solution. In this calculation, both truncation and round-off errors are present. 
The round-off error may accumulate so that it can affect the required solution (after 
many steps) resulting in a significant build-up error. 


6.2 Partial differential equations 


A partial differential equation is quasilinear if it is linear in the highest derivative. For 
example, a second-order quasilinear equation is 


aUuxx + 2buxy + Cuyy = F (x, y, U, Ux, Uy), (6.12) 


where u is an unknown function of the independent variables x and y. This equation 
can be further classified as 


Table 6.5 Example calculation for the Runge—Kutta—Nystr6ém method. 


i | 88) |) 3% y, kı k2 k3 k4 I IT Exact Error 
O | 0.0 | 0.0000000 | 0.0000000 | 0.0500000 | 0.0576250 | 0.0580063 | 0.0663807 | 0.0110421 | 0.1158811 | 0.0000000 | O 

1 | 0.2 | 0.0110421 | 0.1158811 | 0.0663462 | 0.0754087 | 0.0758619 | 0.0858498 | 0.0376840 | 0.1515790 | 0.0110436 | 2 

2 | 0.4 | 0.0487261 | 0.2674601 | 0.0858093 | 0.0966516 | 0.0971937 | 0.1091752 | 0.0721357 | 0.1942251 | 0.0487293 | 3 

3 | 0.6 | 0.1208617 | 0.4616852 | 0.1091273 | 0.1221650 | 0.1228168 | 0.1372540 | 0.1159443 | 0.2454483 | 0.1208671 | 5 

4 | 0.8 | 0.2368060 | 0.7071335 | 0.1371970 | 0.1529355 | 0.1537224 | 0.1711778 | 0.1710170 | 0.3072302 | 0.2368139 | 8 

5 | 1.0 | 0.4078231 0.4078341 | 11 
Note: 


Column I =0.2[y), + (ki + ke + k3)/3]. 
Column IT = (kı + k2 + k3 + k4)/3. 
Error = (Exact — yn) x 10°. 
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(i) elliptic if ac — b? > 0 (Laplace equation), 
(ii) parabolic if ac — b? = 0 (heat equation), 
(iii) hyperbolic if ac — b? < 0 (wave equation). 


Here the coefficients a, b, and c may be functions of x and y. A numerical solution is 
obtained by replacing partial derivatives for the above types with difference quotients 
as shown in the following sections. 


6.2.1 Poisson and Laplace difference equations 


The Poisson equation is given by 


V?u = uxx + uyy = f (x, y). (6.13) 


If f(x, y) = 0, Eq. (6.13) reduces to the Laplace equation 


V?u = uxx + Uyy = 0. (6.14) 


For a numerical solution, consider the following Taylor series formula: 
12 
u(x +h, y)=u(x, y) + hux(x, y) + ait Uxx(X,y)+..., (6.15a) 
1 
u(x — h, y)=u(x, y) — hux(x,y)+ sux, y)+.... (6.15b) 
Adding Eq. (6.15a) and Eq. (6.15b) and neglecting higher-order terms, 
u(x +h, y) +u — h, y) = 2u(x, y) + huss Œ, y), 
or 
1 
uxx (x, y) = zz lee +h, y) — 2u (x, y) + u(x — h, y)]. (6.16a) 


Similarly, 


1 
uyy (x, y) > Z [u(x, y +k) — 2u(x, y) + u(x, y — k)]. (6.16b) 


Substituting Eq. (6.16a) and Eq. (6.16b) into Eq. (6.13), one obtains the Poisson 
difference equation for k =h, 


u(x +h, y) tux, y +h) tule —h, y) tux, y —h) — 4u(x, y) = h? f (x, y). 
(6.17) 
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Similarly, the Laplace difference equation is 


u(x +h, y)+u(x,y +h) +u(x —h, y) +u(x, y—h) — 4u(x, y) =0.} (6.18) 


Here h is the mesh size. As a convenient notation for the mesh points (and the corre- 
sponding values of the solution), let 


Pi =(ih, jh), uij =u(ih, jh). (6.19) 


Hence, for any mesh point P;;, Eq. (6.18) can be written as 


Ui+1,j + Ui, j+1 + Ui-1,j; + Ui,j—1 — 4i, j = 0. (6.20) 


Example 6.2.1. The four sides of a square plate of side 12 cm made of a ho- 
mogeneous material are kept at constant temperature 0°C and 100°C, as shown in 
Fig. 6.1(a). Using a wide grid of mesh 4 cm (see Fig. 6.1(b)), find the steady-state 
temperature at the mesh points. 


Solution. 
For steady-state conditions, the heat conduction equation reduces to the Laplace 
equation and Dirichlet problem (see Example 5.2.6). 


Y 4 u=100 u = 100 
4 4 


O 


12 


zai | | 
© © 


u=0> R <u=0 > u=0> 


FIGURE 6.1 


Heat conduction in (a) a square plate with (b) a meshed grid. 


Using Eq. (6.20) for the mesh points P11, P21, P12, and P2 in Fig. 6.1(b), one obtains 
the following system of equations. For example, for Pı (i = 1, j = 1), one has u21 + 
u12 + uo, + u10 — 4u11 = 0. However, ug; = 0 and u10 = O (from the boundary 
conditions) so that 


—4ui; + u21 + u12 =0. 


re 
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Thus, the system of equations for the four internal mesh points is 


—4uj;  +u21 +12 =0 (Pu) G@=1,j=1), 

uiy —4u2] +u22 =0 (Px) G@=2,j7=1), (6.21) 
u —4ui2 +u =-—100 (P2) (i=1,j=2), 
ur t+uj2 —4u22 =-—100 (P2) ( =2, J= 2). 


This small system of equations can be solved with matrix methods with Maple in 

Appendix A, yielding u11 = u21 = 12.5°C and u12 = u22 = 37.5°C. More exact val- 

ues to one decimal place can be obtained using Fourier series (see Example 5.2.6). 

Taking f(x) = 100 in Eq. (5.29) with b = a = 12, on integrating one obtains 
4 


Ay = —— with n = 1,3,5.... Thus, the analytical solution in Eq. (5.28) 
nz sinh(nz) 
>. nmx . NTY 
400 sin —— sinh —— 
is u(x, y) = — > 12 12 . Summing the series to 10 terms with 
T nsinh(nz) 


n=1,3,5... 
Maple yields the analytical results uj, = u21 = 11.9°C and u12 = u22 = 38.1°C. 
These latter values are identical to the numerical results obtained in Example 7.1.1 
with COMSOL. [answer] 


The Gauss-Seidel (or Liebmann) method and the alternating direction implicit 
method can also be used to solve the system of equations in Eq. (6.21). 


(i) Gauss-Seidel method 


The system in Eq. (6.21) can also be written in the form 


uil = 0.25u21 +0.25u12 +0, 
u21 = 0.25u11 +0.25u22 +0, (6.22) 
ui = 0.25u11 +0.25u22 +25, 
u2 = 0.25u21 +0.25u12 +25. 


Using an iterative method, one starts from a poor approximation to the solution, 
for example, u® = 30, u = 30, u9 = 30, and u9 = 30, and calculates from 


Eq. (6.22) a better approximation. 


Use “old” values 


4 
wy = 0.25u3) +0.25u 12 +0 = 15.00, 
us; = | 0.25u $p +0.25u5) | +0= 11.25, ea 
u$} = | 0.25uiy +0.25u® | +25 = 36.25, 
u9 = 0.25us) +0.25u +25 = 36.88. 
t 


Use “new” values 
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The next step yields 


u? = 0.25uS) +0.25u} +0= 11.88, 
u® = 0.25u? +0.25u5) +0= 12.19, (6.24) 
u= 0.2542 +0.25u} +25 =37.19, 
u2 = 0.25uS) +0.25u +25 = 37.49. 


Further iteration yields the exact solution u11 = u21 = 12.5°C and u12 = un = 
37.5°C. 


(ii) Alternating direction implicit method 


The alternating direction implicit method uses the fact that the system of equations 
has a coefficient matrix that is tridiagonal, that is, the matrix has nonzero entries on 
the main diagonal and immediately adjacent to it. A tridiagonal matrix is obtained if 
there are only three points in a row where Eq. (6.20) can be written as 


Ui-1,j — 4ui,j + ui+1,j = —Ui,j—1 — Ui, j+, (6.25a) 


Ui j—1 — 4i, j + Ui, j+1 = —Ui—1,j — Uit, j- (6.25b) 


In Eq. (6.25a), the left side belongs to the y-row and the right side to the x-column 
i. The Eq. (6.20) can alternatively be written as Eq. (6.25b). Thus, using an iter- 
ative method, on can substitute the mth iteration values on the right-hand side of 
Eq. (6.25a) and solve for the (m + 1)th values on the left-hand side for a fixed row j: 


(m+1) (m+1) (m+1) _ (m) (m) 
Uj, j — 4u; j +uigi j = Up jy — ijy (6.26) 


This gives a system of N algebraic equations in N unknowns (where N is equal to 
the number of internal mesh points per row), which can be solved by Gauss elimina- 


tion. One then proceeds to the next row, and so on. In the next step, the direction is 
alternated and a next approximation is computed, is column by column, from 


the yore values and the given values for the boundary condition (BC). That is, from 
Eq. (6.25b), 
(m+2) (m+2) (m+2) (m+1) (m+1) 
ui j-i T 4u; j +u ji = iaj T i,j (6.27) 


for a fixed column i. 


For the present example, one obtains first approximations u}, u£, u, and u 


from starting values u9 = 30, u® = 30, u9 = 30, and is = 30 and the appropriate 


(d) 
22 
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boundary values in Fig. 6.1b (which can be written without an upper index). Thus, 
from Eq. (6.26) with m = 0, for a fixed j = 1 (first row), the system is 


d) (1) (0) ; 
u —4u +u =— u —u i=l), 
01 11 21 10 12 ( ) 
O(BC) 0(BC) 30 
(1) (1) (0) ; 
u —4u +u =— u —u i= 2): 
11 21 31 20 22 ( ) 


O(BC) O(BC) 30 
The solution is uf) = u£) = 10. For j = 2 (second row) one obtains from Eq. (6.26) 
the following system: 


1 1 0 ; 
uo —4u\) +u% =u) -u3 (i=1), 


1 1 0 ‘ 
u) —4us) +u32 =u —U73 (i =2). 


ve AL) C D 
The solution is Ujy =U = 43.33. 


Second approximations u®, uP, ua, and u) are now obtained from Eq. (6.27) 
with m = 0 by using the first approximations just computed and the boundary values. 
For i = 1 (first column) one obtains from Eq. (6.27) the following system: 
uio 4u +u =—u9 ~uf}  (j=1), 
ed —4u\ +u13 =—U02 =u) (j =2). 
The solution is uf? = 12.22, u = 38.89. 


For i = 2 (second column) one obtains from Eq. (6.27) the following system: 


2 2 š 

u20 4u +u =—uü]] —u3] G=D, 
2 2 1 P 

i =A) +u23 = ati —u32 (j =2). 


The solution is u$? = 12.22, uf) = 38.89. 

Thus, in the second approximation, i = i = 12.22°C and ie = u) = 
38.89°C. These values are about the same as those of two Gauss-Seidel steps in 
(1). 

Neumann and mixed problems 

In Neumann and mixed problems, there are boundary points at which the normal 


ae ðu. 
derivative u, = Sa is given but not u. 
n 
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Example 6.2.2. Solve the mixed boundary value problem (see Fig. 6.2) for the Pois- 
son equation: 


Vu = Uy + uyy = Oxy. 


Pi} P93 
4 9 9 
y un =2(x + 1) i 
J Poo 'Pi2 (P2 P32 (u=8) 
1 1 x x 
(un =3) (un =4) 
Por. Pu Pa P3; (u=5) 
u=0> R <u =4(y? +1) 0.5 e e 
0 ~ ~ 
t 1.5 x Pio P2 
u=0 (u=0) (u=0) 
0 0.5 1.0 1.5 
(a) (b) 
FIGURE 6.2 


Heat conduction in (a) a rectangular plate with mixed boundaries with (b) a meshed grid. 


Solution. 
For the grid in Fig. 6.2(b), h = 0.5. From the boundary conditions u = A(y +1) and 
Un = 2(x + 1) so that 


u31 = 4[(0.5)? + 1] = 5, u32 = 4[(1)? + 1] =8, 


du12 du12 


du22 (6.28) 
= 2[0.5 + 1] = 3, —— = 2[1 + 1] = 4. 
on dy dy 


The interior points P;; and P; are obtained from Eq. (6.17) (with h? = 0.25 and 
f(x, y) = 12xy) and the given boundary values. Thus, 


—4uj, +u21 +412 = —0 —0 +0.25. (16)(0.5)(0.5)= 1, 
—— ye No , 


—uol —u10 h? (16) (x) (y) 


uy, —4u21 +tu2 = —5+2=—3. 
(6.29) 
For the points Pı2 and P22, one extends the region R (to y = 1.5), and Eq. (6.17) 
yields 


u11 —4ui2 +u +u13 =2, 


(6.30) 
u21 t+uj2 —4un +uz3 =—8+4= -—4. 
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From Eq. (6.28), 


_ 0u12 U13 — uil 


3 =u uil = u13 = u11 +3, 
Jy z; 13 — u11 13 11 
ðU22 _ U23 — U21 

4= N =u U21 > U23 = U21 + 4. 
dy Th a= 23 = U21 


Substituting these results into Eq. (6.30) gives 


2u11 —4ui2 +u =2-3=-1, 
2u21 +u12 —4ur2 =—-4-4=-8. (6.31) 
Thus, Eq. (6.29) and Eq. (6.31) in matrix form is 
—4 1 1 0 uli 1 
1 -4 0 1 uz | | —3 
2 20 -4 ee ee ee (6.32) 
0 2 1 —4 u22 —8 


The solution of Eq. (6.32) using Maple to solve this matrix system of equations in 
Appendix A is u11 = 0.484, u21 = 1.658, u12 = 1.280, u22 = 3.149. The more exact 
values as evaluated with a complete numerical analysis with COMSOL in Exam- 
ple 7.1.2 are uil = 0.507, u21 = 1.717, u12 = 1.466, u22 = 3.269. 


6.2.2 Parabolic equations 


By an appropriate transformation, the heat equation and corresponding boundary con- 
ditions can be written as 


Ut = Uxx, O<x<1,r=0, 
u(x,0) = f(x) (initial condition), (6.33) 
u(0,t) =u(1,t) =0 (boundary conditions). 


Using Eq. (6.16a), a simple finite difference approximation of Eq. (6.33) is 


1 


1 
T (ui j+1 — ui j) = 72 (ui+i,j — Qui, j + uj-1,;) (6.34) 


for a mesh size A in the x direction and k in the t direction. Solving for u;, j+1 (which 
corresponds to time row j + 1 in Fig. 6.3), 


k 
ui j+1 = (1 — 2r)ui j +r(ui+1,j + U4i-1,;), r= a. (6.35) 


It can be shown for convergence that 


(6.36) 


ee ee Eee 
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Time row j + 1 


x x Time row j 


FIGURE 6.3 
The four points in Eq. (6.34) and Eq. (6.35). 


x x Time row j + 1 


X x Time row j 


FIGURE 6.4 
The six points in the Crank-Nicolson formulas in Eq. (6.38) and Eq. (6.39). 


The restriction in Eq. (6.36) can be avoided with the Crank—Nicolson method, which 
replaces the difference quotient on the right-hand side of Eq. (6.34) by 5 times the 
sum of two such difference quotients at two time rows (see Fig. 6.4), that is, 


1 1 
L (ui, j+1 — i,j) 72 (wi+1,j — 2ui,j + ui-1,;) 
(6.37) 
t572 (ui+1,j+1 — Zui, j+1 + ui-1,j+1), 


or equivalently (with r = k/h?) 


(2+2r)ui j+1 —1 Wi41, jH1 +4i-1,j41) = (2—2r)ui j +r(ui+1,j +ui—1,j). (6.38) 


If r = 1, Eq. (6.38) becomes simply 


4ui j+1 — Ui+1,j+1 — Ui—1,j+1 = Ui+l,j + Ui—l,j- (6.39) 


Example 6.2.3 (Heat conduction in an insulated bar). Consider a laterally insulated 
metal bar of unit length and « = 1. The ends of the bar are kept at a temperature of 
u = 0°C. At t = 0, the temperature distribution is given by the triangular distribution 
in Fig. 2.8 with L = 1 such that 


2x if0 <x <0.5, 


IO) 5 te ous. aps ee al 


eee 
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For h = 0.2, find the temperature in the bar for 0 < t < 0.08. 
Solution. 
(i) Crank—Nicolson method 


Taking r = 1, we have k = h? = 0.04 (see grid in Fig. 6.5). The initial values are 
u10 = 0.4 and u20 = 0.8. Also by symmetry, u30 = u20 and u40 = u10. In addition, 
u31 = U2) and u4; = u11. Using Eq. (6.39), for j = 0, 


| 
P P. 
0.08 E2 Pz j=? 
P- P. 
go Pu Pa j=j 
Pio Pao P30 P40 


— 
x 
FIGURE 6.5 
Mesh for the numerical solution of an insulated metal bar. 
(i=1): 
4uj, —u21 Mol = u2 + uoo = 0.8, 
eed =~ eed 
O(BC) 0.8 0(BC) 
(i=2): 
—uy, +4u2, — u31 =uio + u30 =0.44+0.8= 1.2. 
=~ eed 
u21 (sym.) u29(sym.) 
Therefore, 


4u] —u21 = 0.8, 


6.40 
—uy, +4u2) —u2, = 1.2. ( ) 


The solution of Eq. (6.40) is 


uj, = 0.327 and u21 = 0.509. 
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Similarly, for j = 1, 


G=1) 4uy2 ~un — ug2 =uo, +u21 =0+0.509 = 0.509, 
we 
O(BC) 
(G=2) —uy2+4u22 = u32 = u11 + u31 = 0.327 + 0.509 = 0.836. 
Ww nH 
u22 (symmetry) u21 
The system then becomes 
4u12 —u22 = 0.509091, 


(6.41) 
—u12 +4u22 —u22 = 0.836364, 


with the following solution for Eq. (6.41): 


u12 = 0.215 and u22 = 0.350. 


The temperature distribution in the bar is given in Table 6.6. The dashed vertical line 
in Table 6.6 indicates the line of symmetry. 


Table 6.6 Temperature in a thin bar using a Crank—Nicolson method. 


t x=0 x=0.2 x=04 | x=0.6 x=0.8 x=1 j 

0.00 0 0.4 0.8 | 0.8 0.4 0 0 

0.04 0 0.327 0.509 | 0.509 0.327 0 1 

0.08 0 0.215 0.350 | 0.350 0.215 0 2 
i=1 i=2 


(ii) Direct method 


Letting r = 0.25 (< 5) yields k = rh? = 0.25(.2)7 = 0.01. As such, this value re- 
quires four times as many time steps as the Crank—Nicolson method. The formula in 
Eq. (6.35) with r = 0.25 gives 


Ui j+1 = 0.25(uj-1, ;j + 2uij + uj+1,j)- (6.42) 


For j = 0, uoo = 0 (boundary condition), u10 = 0.4, u20 = u30 = 0.8 (by symmetry) 
so that 


(G=1) u1; =0.25(ugg + 2u10 + u20) = 0.4, 
(i=2) u2, = 0.25(uj0 + 2u20 + u30) = 0.25(u10 + 3u20) = 0.7. 


For j = 1 and noting that uo; = uo2 = ... = 0 from the boundary conditions, 


G@=1) u2 =0.25(uo1 +2u11 + u21) = 0.375, 
(§=2) un =0.25(u11 + 3u21) = 0.625. 
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and so on. As shown in Table 6.7, the accuracy is comparable to that of the Crank— 
Nicolson method. The exact solution is obtained from the general solution in Exam- 
ple 5.2.4 where L = x = 1. Thus, substituting 


2x if O<x <0.5, 


FOS] oa x) ifo5<x<1. 


8 
into Eq. (5.24) and integrating with Maple yields B, = ae sin (=). Hence, the 
nm 
analytical solution is given by Eq. (5.23) 


8B si 2 
u(x,t) = -2 > = ee ) sin(nrx)e 0, 
n=1 
Using Maple to sum the series to 10 terms yields the exact solution also shown to 
three decimal places. In addition, the COMSOL result using a finite element tech- 


nique with an error tolerance of 1 x 1076 in Example 7.1.3 is also shown in Table 6.7, 
which is in perfect agreement with the analytical solution. 


Table 6.7 Temperature in a thin bar using a direct method. 


t 1052) x=0.4 

CN Direct | Exact | COMSOL | CN Direct | Exact | COMSOL 
(0) 0.4 0.4 0.4 0.4 0.8 0.8 0.8 0.8 
0.04 | 0.327 | 0.313 | 0.319 | 0.319 0.509 | 0.508 | 0.521 | 0.521 
0.08 | 0.215 | 0.210 | 0.216 | 0.216 0.350 | 0.340 | 0.350 | 0.350 


CN - Crank-Nicolson method. 


6.2.3 Hyperbolic equations 


By an appropriate transformation, the wave equation and corresponding boundary 
conditions can be written as 


Utt = Uxx, O<x<l1l,t>0, 
u(x,0)= f(x initial displacement), 
&,0)= f(x) ( p i ) (6.43) 
u;(x, 0) = g(x) (initial velocity), 
u(0O,t) =u(1,t) =0 (boundary conditions). 


Using Eq. (6.16a) and Eq. (6.16b), a finite difference approximation of Eq. (6.43) is 


1 1 
Z (ui j+1 — 2uij + uij-1) = 72 (uisi,j — 2ui j +ui-1,;) 5 (6.44) 
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where A is the mesh size in the x direction and k in the mesh size in the ¢ direction. 

Similarly, solving for u;,;+1 (which corresponds to time row j + 1 in Fig. 6.6), where 
k2 

Pies, Co 


r 5 


Ui j+1 = Ui-1,j + Ui+1,j + Ui, j—1- (6.45) 


It can be shown for that for 0 < r* < 1, the present method is stable. 


x Time row j + 1 ° 
k k 
X * x Time row j x x 
h h h h 
k k 
x Time row j — 1 x 
(a) (b) 


FIGURE 6.6 
Mesh points used in Eq. (6.44) and Eq. (6.45). 


For the condition u;(x, 0) = g(x) in Eq. (6.43), one can derive the difference formula 


1 
J (uii — ui -1)=8(i) Or ui -—1= ui 1 — 2kgi, (6.46) 


where g; = g (ih). For t = 0 (that is, j = 0), Eq. (6.45) and Eq. (6.46) yield 


(ui-1,0 +ui+1,0)+kgi| G =0) (6.47) 


ui 1 = 


NI = 


(which expresses u; 1 in terms of the initial data). 


Example 6.2.4 (Vibrating string). Consider a string of unit length and c = 1. The 
grid is the same as in Fig. 6.5, where the string is fixed at both ends, and has the same 
2x if O<x <0.5, 
2(1—x) if O5<x <1, 
g(x) = 0. For h = k = 0.2, solve the wave equation for the deflection of a string 
u(x,t). This problem is analogous to Example 6.2.3. 


initial distribution: f(x) = with an initial velocity 


eee 
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Solution. 

The grid is the same as in Fig. 6.5, except for the t values which are now at 0.2, 0.4, ... 
(instead of 0.04, 0.08...). The initial values are the same as Example 6.2.3. From 
Eq. (6.47) with g(x) = 0, 


ui 1 = (ui-1,0 a ui+1,0) 


NI = 


so that 


1 1 
G@=1) u= z (ugo + u20) = z [0 + 0.8]=0.4, 


1 1 
G@=2) un= 5 (u10 + u30) = 5 [0.4 + 0.8] = 0.6. 
As in the previous example, by symmetry, u31 = u21 and u41 = u11. 


From Eq. (6.47) with j = 1 (where uo) = uo2 = ... = 0 from the boundary condi- 
tions): 


G=1) u2 = uoi + u21 — u10 = 04+ 0.6 — 0.4 = 0.2, 
(i =2) un =u]; + u31 — u20 = 0.4 + 0.6 — 0.8 = 0.2. 


In the computation for (i = 2), the symmetry u31 = u21 has been used. Also by 
symmetry, u32 = U22 and u42 = u12. Proceeding in this manner, the values in Ta- 
ble 6.8 are obtained. The line of symmetry is shown by the dashed line. An analyt- 
ical solution is derived in Example 5.2.3, where for this problem L = c? = 1 and 


g(x) = 0 (so that A, = 0 in Eq. (5.19)). Moreover, the same integral arises for By, 
8 

in Eq. (5.18), which is evaluated as B, = Gaye sin (=). Hence, from Eq. (5.17), 
ni 


8 si 2 
u(x,t) = > = sonra] sin(nzx) cos(nzt). Maple can be used to sum this 
m2 n2 


n=1 
series, where 100 terms are needed since the analytic series converges more slowly 
than the one in Example 6.2.3. The numerical values in Table 6.8 are in fact identical 


with the analytic solution as well as the COMSOL solution in Example 7.1.4. 


Table 6.8 Distribution of a vibrating string. 


t x=0 x=0.2 x=0.4 x=0.6 x=0.8 amil 
0.00 | 0 0.4 0.8 | 0.8 0.4 fo) 
0.2 fo) 0.4 0.6 0.6 0.4 f) 
0.4 fo) 0.2 0.2 | 0.2 0.2 fo) 
0.6 fo) -0.2 —0.2 —0.2 —0.2 fo) 
0.8 fo) —0.4 —0.6 —0.6 —0.4 fo) 
1.0 fo) -0.4 -08 |! -08 —0.4 fo) 
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Problems 
6.1 Explain the advantages and disadvantages of the Adam—Moulton, Runge-Kutta, 
and Euler methods for the numerical solution of first-order ordinary differential equa- 
tions. 

d 
6.2 Consider the first-order ordinary differential equation y’ = ef f(x, y). De- 


rive the predictor and corrector equations for the Heun method by using a truncated 
Taylor series for yYn+1. 


6.3 Given the initial value problem y’ = y — y?, y(0) = 0.5: 

(a) Solve the problem exactly using standard analytical methods for first-order 
ordinary differential equations. 

(b) Apply the Runge-Kutta method (of fourth order) and solve the problem for 
0 <x < 1, with a step size of h = 0.2. Compute the error, using the exact values in 
part (a). 


6.4 Using the Maple software package: 

(a) Consider the square plate in Example 6.2.1 for a mesh spacing of h = 4 cm 
(that is, h = a/n, where a = 12 cm and n = 3). However, the bottom and side bound- 
aries each have a temperature of 100°C and the top boundary is at 0°C. Therefore 
show that Uil = U21 = 87.5°C and Uui? = Un = 62.5°C. 

(b) Repeat the calculation for a mesh spacing of h = 3 cm (that is, n = 4) and find 
the steady-state temperature at the internal mesh points. Hint: The number of mesh 
points can be reduced if symmetry is considered. 

(c) The analytical solution is given by 


[ee] 


wa 400 y 1 
u(x, y) = — - 
Um Z nt sihn r 
| sin(Qn+1)mx . (2n+ 1)r(a-— y) 
x sinh 
a a 
4 sin(2n + l)my sinh (2n + 1)n(a — x) eee (2n + a] l. 
a a a 


Show that this analytical solution yields the values u1] = u21 = 88.1°C and u12 = 
u22 = 61.9°C for the given mesh in part (a). Compare the numerical results of part 
(b) with the values obtained from the analytical solution. 


6.5 Write the steady-state heat conduction equation (that is, the Laplace equation) 
for a Cartesian coordinate system for two spatial dimensions. During the winter (in 
which the air has an ambient temperature of —10°C), a thin car window with dimen- 
sions 1 m high by 1.5 m wide is completely covered with ice. A poor quality electric 
heater is only able to apply heat to the bottom edge of the window at 23°C. The other 
edges of the window remain at the ambient temperature of the air. Determine if one 


a 
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can see through the middle third of the bottom half of the window if one waits for 
a long enough time by performing a numerical solution of the steady-state heat con- 
duction equation with stated boundary conditions (that is, calculate the temperature 
at the mesh points u1; and u21 with a given mesh size of h = 0.5 m). 


6.6 Consider the temperature distribution u(x,t) in an insulated bar of length L 
2 


as governed by the heat conduction equation vi ea subject to the boundary 
t X 
.. ðu DE a 
conditions — =O0t>0,u(L,t) =0, t > 0, and initial condition u(x, 0) = uo, 
X |y=0 


O0<x<L,t=0. 


(a) Consider the transformation of the above parabolic heat conduction equation. 
u 


: : ; i x Kt, 
Using the nondimensional variables v = —, n = T’ and t = TZ give the trans- 
u 


formed partial differential equation for v (n, 7) with the corresponding boundary and 
initial conditions. 


(b) Consider the direct method for the numerical solution of the parabolic partial 
differential equation. Show that with k = 1 and L = 1, with a reflexive boundary 
condition at i = 0, the following formula results: uo, j+1 = (1 — 2r)uo,j + 2ruy4,;, 
where r = k/h?. 


6.7 Explain what methodology could be used to determine if a numerical solution 
for a given problem is correct and accurate. 


6.8 Consider the heat conduction in a thin thermally insulated bar of length L which 
is initially at a temperature of 0°C. At t > 0, heat is applied to the ends of the bar, 
which are kept at a constant temperature of 55°C. The temperature distribution is 
32 
therefore described by the partial differential equation: F =K -a with boundary 
x 
conditions u(0, f) = u(L, f) = 55, f > 0, and initial conditions u(x,0) = 0, for 0 < 
X¥<L,t=0. 
(a) What transformation can be applied to the independent variables in order to 
32 
reduce the problem to a < or Ut = Uxx With the corresponding boundary and 
initial conditions u (0, t) = u (1, t) = 55, t > 0, and u(x, 0) = 0, for 0 < x < 1,t = 0? 
(b) Using a Crank—Nicolson method, solve the two resulting finite difference equa- 
tions for the first time step for the unknowns u1; and u21 with r = k/ h? = 1 and 
h = 0.2. What is the advantage of the Crank—Nicolson method over the conventional 
direct method? 


(c) By transforming the dependent variable and using the more general solution in 
Example 5.2.4, give the analytical solution for the problem in part (a). 


CHAPTER 


Finite element methods 


Finite elements provide an alternative numerical technique in contrast to finite differ- 
ence techniques for the solution of ordinary and partial differential equations. 

COMSOL MULTIPHYSICS (COMSOL Inc., 2018) is a commercial software 
package that can be used to numerically solve real-world ordinary and partial dif- 
ferential equations arising in various engineering applications. The Finite Element 
Method (FEM) or equivalently Finite Element Analysis (FEA) is widely utilized in 
well known engineering simulation packages such as ANSYS, ABAQUS, LS-DYNA, 
NASTRAN and a variety of free and open source tools. This tool is particularly use- 
ful if the problems are nonlinear where the coefficients of the differential equation 
may be a function of the independent variable (such as for the thermal conductivity 
depending on temperature in a heat conduction problem). 

The application of this tool is detailed in Section 7.1. All numerical problems 
presented in Chapter 6 are solved again. In this discussion, the use of the tool is 
demonstrated with an annotation of the graphical user interface for solution of a given 
problem in a step-by-step manner. This description therefore provides the procedure 
to solve different types of problems for both ordinary and partial differential equa- 
tions. This is an extremely important software tool that an engineer would use for 
a real-world problem rather than requiring the writing of their own computer pro- 
gram for the numerical solution. Thus, COMSOL can be used to solve very complex 
and coupled ordinary and partial differential equations as “multiphysics” type anal- 
ysis. This terminology means that one can couple all types of physical phenomena 
together, including, for example, heat and mass transfer, chemical reactions, material 
structural effects of stress and strain, and thermophysical properties of materials that 
may depend on independent solution variables as a single problem. A more complex 
multiphysics problem in given as Problem 7.7. 

The general theory of finite elements, which underscores the development of 
COMSOL, is also detailed in Section 7.2. This discussion involves the covering of a 
domain with finite elements of various dimensions as well as the development of lin- 
ear and higher-order shape functions. This technique approximates the solution to the 
differential equation with appropriate weightings and numerical techniques to mini- 
mize the error (for example, the Galerkin method). A sample finite element problem 
is worked through by hand in Problems 7.8 to 7.18. 
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7.1 COMSOL application methodology 


The COMSOL methodology specifically involves the following general steps to solve 
a given problem, with a selection of 


e model wizard (or an import of a previous simulation); 

e space dimension (dimensional representation and symmetry of the problem); 

e type of physics to be solved (for example, heat conduction or wave equation); 

e type of solver: stationary (steady-state) or time-dependent problem; 

e model builder options and solver, including: (i) geometry/physical object(s) being 
simulated, (ii) coupled multiphysics, (iii) material properties, and (iv) boundary 
and initial conditions; 

e finite element mesh size for the object; 

e problem solution and output/process of results. 


7.1.1 Solved problems with COMSOL 


The step-by-step methodology for solution with COMSOL 5.4 of various problem 
examples are described below. Problems taken from Chapter 6 are identified in each 
of the following examples. 


Partial differential equations 

Example 7.1.1 (see Example 6.2.1). As shown in Fig. 6.1(a), four sides of a square 
plate (12 cm x 12 cm) are kept at constant temperature 0°C and 100°C. Find the 
steady-state temperature for this Laplace heat conduction problem with Dirichlet 
boundary conditions with COMSOL. 

Solution. Select the following choices. 


Step 1: start model application. 
New: click “Model Wizard” (see Fig. 7.1). 


New 


Ej <+—_ 


Model Wizard 


mph 
Blank Model 


FIGURE 7.1 
Start of the COMSOL 5.4 software tool. 


Step 2: space dimension. 

Space dimension: click “2D” (see Fig. 7.2) since the plate can be considered two- 
dimensional if the temperature doesn’t change significantly through the thickness of 
the plate. 


ee eee: Eee 
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Select Space Dimension 


aleie)/ei—| - 
30 en 20 18 1D oD 


Axisymmetric Axisymmetric 


FIGURE 7.2 


Selection of a space dimension. 


Select Physics Review Physics Interface 


Search Heat Transfer in Solids (ht) 


4 © Recently Used 
\® Heat Transfer in Solids (ht) 
¥ aoc scone: f 
) Acoustics 


Dependent Variables 


f Chemical Species Transport 
Fluid Flow 
4 l| Heat Transfer 
\@ Heat Transfer in Solids (ht) + 
i= Heat Transfer in Fluids (ht) 
læ Heat Transfer in Solids and Fluids (ht) 
{= Conjugate Heat Transfer 
D xil Electromagnetic Heating 
{@ Heat Transfer in Porous Media (ht) 
& Structural Mechanics 
u Mathematics 


Add 4 


Added physics interfaces: 
\® Heat Transfer in Solids (ht) 


Remove 


Q Space Dimension © Study<j— 
Help (x} Cancel [V Done 
FIGURE 7.3 


Selection of the physics for a heat transfer analysis. 


Step 3: physics. 

Physics: “Heat Transfer in Solids (ht)” — click “Add” — click “Study” (see Fig. 7.3). 
The dependent variable is the temperature T and the independent variables are the 
positions x and y. 


eee 
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Step 4: study. 
Study: click “Stationary” — click “M Done” (see Fig. 7.4). 


Select Study 
4 ~ General Studies 
Stationary g— 
^, Time Dependent 
~> Empty Study 
Added study: 


Added physics interfaces: 
\® Heat Transfer in Solids (ht) 


Q Prysics 
E Her 3 conl [X Done <q— 


FIGURE 7.4 


Selection of the time dependence for the heat transfer model. A time-independent 
(stationary) model is chosen. 


Step 5: model builder. 

The following Model Builder page is generated; choose the “Length unit” as cm (see 
Fig. 7.5). During the creation of an object (especially a complex one), COMSOL 
automatically repairs any generated geometry by removing small edges and faces 
through the “Default repair tolerance” selection. The default value is relative to the 
overall size of the geometry but can be edited to a smaller tolerance value as required 


ee = O 
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®\Rmeuwr b REA” 
Home Definitions Geometry Materials Physics Mesh Study Results Developer 
SIN 3 » Variables ~ = Import 
A @| P : R je g 
foo Functions + 
Application Component Parameters Build Heat Transfer | Ad 
Builder ” . All in Solids + |Phy 
Model Builder Settings Properties = 
dead th By Geometry 
4 @ Untitled.mph (root) @ Build All 
4 © Global Definitions 
Parameters 1 Label: Geometry 1 =] 
it Materials Uni 
v 
49g Component 1 (compi) nits 
= Definitions C Scale values when changing units 
A 
7A Geometry 1 Length unit: 
"i Materials 
am X 
4 \®@ Heat Transfer in Solids (ht) 
&® Solid 1 Angular unit: 
& Initial Values 1 Degrees Z 
25 Thermal Insulation 1 
A Mesh 1 v Advanced 
& Results Default repair tolerance: 
Automatic z 
[Z] Automatic rebuild 
FIGURE 7.5 
Model builder page and selection of the unit length of the square plate for the heat transfer 


analysis. 


if one needs extra fine detail. For the simple plate in this example, an “Automatic” 
choice is selected. 


Using this page perform the following actions. Right click “Geometry 1” (choose 
“Square L]” in the drop down menu — in the resulting figure enter “12” cm for the 
“Side length” and select “Build All Objects”) (see Fig. 7.6). 


Right click “Materials” (choose “Blank Material” — thermal conductivity value for 
k of “1” (see Fig. 7.7). As shown for the Laplace equation in Eq. (5.25) for heat con- 
duction in a plate, this problem is independent of the value of k but a value needs to be 
entered or an error will occur while the computation is performed in Step 7). This is- 
sue can be avoided by selecting “Mathematics” —~> “Classical PDEs” —> “Laplace 
equation” for the Physics option in Step 3 (see, for example, Example 7.1.2), where 
a material entry is no longer required. 


Right click: “Heat Transfer in Solids (ht)” and select “Temperature” (do this four 
times to add the four temperature subheadings under “Heat Transfer in Solids (ht)”: 
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Home 


Definitions Geometry Materials Physics Mesh Study Results Developer 
A D Pi 1 Variables ~ B = Import f 4 
(0 Functions + 
Application Component Parameters Build Heat Transfer | Ac 
Builder ’ , All in Solids ~ [Ph 
Model Builder Settings Properties vs 
ev itl H = 
= = m y Square 
a z 
4 @ Untitled.mph (root) ® Build Selected ¥ @ Build All Objects È 
4 © Global Definitions 
Parameters 1 Label: Square 1 i} 
tit Materials eae 
v 
4 @ Component 1 (comp?) eyes 
= Definitions Type: | Solid z 
4 A Geometry 1 
go Square 1 (sq1) v Size 
* Form Union (fin) Š 
2, i Si HL 
š: Materials Side kength am 
4 \® Heat Transfer in Solids (ht) v Position 
&@ Solid 1 
%@ Initial Values 1 Base: Corner X 
85 Thermal Insulation 1 
x 0 cm 
Â Mesh 1 
> & Results y: a an 
v Rotation Angle 
Rotation: 0 deg 
Layers 
v Selections of Resulting Entities 
[C Resulting objects selection 
in physic Domain selectior 
None X 
FIGURE 7.6 


Selection of the size of the square plate for the heat transfer analysis. 


“Temperature 1,” “Temperature 2,” “Temperature 3,” “Temperature 4”) (see Fig. 7.8). 
Click on “Temperature 1,” click on one side of the square in the picture, and then 
enter one of the four boundary condition values in the box under ““Temperature:” 
as a “User defined” value. Repeat this step for the other three temperature headings 
to assign the four Dirichlet boundary conditions on each side of the square. Note 
that calculations are performed in SI units with temperatures in K, but the boundary 
temperatures can be input instead in degrees Celsius as 0 [degC] and 100 [degC], 
accordingly. 


Step 6: create a finite element mesh for the object. 
Click on “Mesh 1,” select the “Element size:” of the mesh (such as “Normal” or, 
if desired, the mesh can be finer for better accuracy) and click on “Build All.” A 
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FIGURE 7.7 


Selection of the material properties of the square plate for the heat transfer analysis. 
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FIGURE 7.8 


Selection of boundary conditions for the heat transfer analysis for the square plate. 
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“Physics-controlled mesh” is automatically generated of triangular shape as shown 
in the picture on the right-hand side of the page (see Fig. 7.9). 
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FIGURE 7.9 


Meshing of the square plate. 


Step 7: solve the problem and output results. 

Finally, the problem can be solved by clicking on the “= Compute” at the top banner 
or by right clicking “Study 1” and choosing “Compute” at the top of the pop-up 
box. 

The results are shown as a color scale plot on the right-hand side by clicking 
the “Temperature (ht)? entry (see Fig. 7.10). By clicking on a specific point in 
the color plot one can obtain the temperature T in K under the plot in a ta- 
ble for the given position of “X:” and “Y:” that is clicked. The results can be 
given in degrees Celsius by selecting “Surface” under the “Temperature (ht)” en- 
try and changing the “Unit:” to “degC” in the drop down box and clicking the 
“Plot” button under the “Surface” heading at the top on the right-hand side of the 


page. 


However, this method is more difficult when one wishes to obtain a value of the tem- 
perature at an exact position of interest on the plot. Thus, to obtain a temperature at 
a precise coordinate location: 

(i) Right click “Data Sets” and choose “Cut Point 2D L]? A new item is created 


as “LI Cut Point 2D 1.” Enter the coordinates of the desired point, for example: 
“X:” 4 cm and “Y:” 4 cm for evaluation of the temperature at this location (see 
Fig. 7.11). 


(ii) Right click “Results” and click “Evaluation Group,” resulting in a new entry in 
the menu along the left-hand side: “Evaluation Group 1.” Right click on “Evalua- 
tion Group 1” and choose “Point Evaluation,” which results in another new entry 
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FIGURE 7.11 


Selection of the coordinate location on the square plate for evaluation of the temperature. 
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in the menu along the left-hand side of “Point Evaluation 1.” Click on “Evalua- 
tion Group 1” and select “Cut Point 2D 1” in the selection box for “Data set:.” 
Click on the “Point Evaluation 1” item and in the box for “Data set:” add under 
the “Expression” heading: “T(K)-273.15,” delete the “K” symbol under the “Unit” 
heading, and add “[degC]” under the “Description” heading (see Fig. 7.12). This 
relabeling provides values in degrees Celsius (vice K). By selecting “Evaluate” on 
the “Point Evaluation 1” page, the result at position (4,4) is shown at the bottom 
of the table under the figure as 11.928 [degC]. Likewise, the temperature at po- 
sition (4, 8) is 38.072 [degC]. This latter value is obtained again by clicking on 
the “Cut Point 2D 1” item in step (i) and choosing coordinates (4, 8) for “X:” and 
“Y: These two results are in agreement with the analytical Fourier series solution. 
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FIGURE 7.12 


Evaluation of the temperature at a particular coordinate location on the square plate (in °C). 


Example 7.1.2 (see Example 6.2.2). Similarly, the Poisson equation 
V-u = 16xy (7.1) 


can be solved with COMSOL for the boundary value problem involving both Dirich- 
let and Neumann conditions, as shown in Fig. 6.2(a). 


Solution. The same methodology as used in Example 7.1.1 is applied, except that the 
physics, geometry, and boundary conditions are different. 
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In Step 3, “Poisson’s equation” is chosen under “Classical PDEs” in place of the 
physics model “Heat Transfer in Solids (ht)” (see Fig. 7.13). 
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FIGURE 7.13 


Selection of the physics for solution of a Poisson equation. 


In Step 5 a “Rectangle” is chosen for “Geometry 1” (with a “Width:” 1.5 and 
“Height:” 1.0) (see Fig. 7.14). 


Right click “Poisson’s equation (poeq)” and select “Dirichlet Boundary Condi- 
tion” (three times) and “Flux Source” for the four mixed boundary conditions (see 
Fig. 7.15). Click on “Dirichlet Boundary Condition 1” and “Dirichlet Boundary Con- 
dition 2” and enter a “0” in the box for each of these “r” entries under the “Dirichlet 
Boundary Condition” heading after clicking on each of the two sides of the figure. 
However, for the more complicated “Dirichlet Boundary Condition 3,” click on the 
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far side of the square in the picture and then enter “4*(y*2+1)” for “r” (see bottom 
pane of Fig. 7.15). Finally, for the last boundary with the Neumann condition, click 
on “Flux/Source 1” and enter “2*(x+1)” in the box for “g” after clicking on the top 
boundary in the figure (see top pane of Fig. 7.15). 


Click on the “Poisson Equation 1” and enter “-16*x*y” for the parameter “f”? to com- 
plete the form of Eq. (7.1). Use an “Element size:” of “Extra fine” for “Mesh 1” to 
obtain higher accuracy. 


The COMSOL solution gives u(0.5, 0.5) = 0.507, u(1, 0.5) = 1.717, u(0.5, 1) = 
1.466, and u(1, 1) = 3.269 (see Fig. 7.16). The value of the temperature at x = 0.5 
and y = 0.5 is shown in the table under the graphic in the top pane of Fig. 7.16, and 
the value at x = 1.0 and y = 1.0 is further shown in the bottom table under the graphic 
in the bottom pane of Fig. 7.16. For this problem with more complicated boundary 
conditions, the numerical solution of Example 6.2.2 with a course grid spacing is in 
reasonable agreement with the accurate solution provided by COMSOL. 


Example 7.1.3 (see Example 6.2.3). Similarly, the parabolic equation 
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Entry of boundary conditions for the rectangle for solution of the Poisson equation. 


can be solved for the time-dependent heat conduction problem for an insulated metal 
bar of unit length, where the ends of the bar are kept at a temperature of 0°C. An 


initial temperature distribution is applied: 


2x 
f@)= 


2(1—x) 


if O <x <0.5, 
if 05<x <1. 


Solution. The same methodology can be used as in the previous examples. The 
changes to the physics, geometry, and boundary conditions are detailed below. A 
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FIGURE 7.16 


Solution of the Poisson equation. 
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FIGURE 7.17 


Selection of the heat equation in COMSOL for the time-dependent heat conduction in a thin 
metal bar. 


one-dimensional model is chosen for the thin metal bar. In the model setup, “Heat 
equation(hteq)” is chosen for the physics under “Mathematics —> “Classical PDEs” 
(see Fig. 7.17). 


Under “Geometry 1,” “Interval” is selected, resulting in a menu item of “Interval 1,” 
where the “Coordinates” 0 and 1 can be input representing the bar of unit length. 
By right clicking “V? Heat Equation (hteq)” in the left-side menu, the “Dirichlet 
Boundary Condition 1” and “Dirichlet Boundary Condition 2” items can be created 
in order to apply the boundary conditions of zero at each end of the bar. Eq. (7.2) 
can be input by clicking on “Heat Equation 1” and choosing c = 1, f = 0, and 
da = 1 (see Fig. 7.18). The initial value of (x < 0.5) * (2 x x) + (x > 0.5) * (1 — x) 
can be input into the box for the “Initial value for u” on clicking the “Initial Val- 
ues 1” menu item (see Fig. 7.19). Here the inequalities (x < 0.5) and (x > 0.5) 
are interpreted by COMSOL as conditional statements which evaluate as 1 if the 
inequality is true, and O if the inequality is false. The solid is then meshed and 
built. 


The needed accuracy for the problem requires a suitable mesh size and time step. 
The object can be meshed with a finer mesh size by clicking on the “Mesh 1” menu 
item and selecting the “Element size.” The tolerance for the time step can be entered 
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FIGURE 7.18 


Input of the heat equation physics for conduction of heat in the thin metal bar. 
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FIGURE 7.19 


Input of the initial condition for the problem. 


in the “Step 1: Time Dependent” menu item (see Fig. 7.20), and choosing an input 
value in the box for the “User Controlled” “Relative tolerance” (that is, 1E—6 in this 
example). The output time range (that is, the time at a reported step and the total time 
of the problem simulation) is also input by the user on this page. 
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FIGURE 7.20 


Selection of time-dependent range and relative tolerance. 


A solution can then be computed with a selection of the “= Compute” button after 
right clicking the “Study 1” menu item. 


For comparison to the values in Table 6.7, an output step size of 0.04 s is chosen 
with a total simulation time of 0.08 s. To output the results at a specific x location, 
right click “Results” and choose “Evaluation Group” in the drop-down box, which 
creates the submenu item “Evaluation Group 1.” Right click “Data Sets” and select 
“Cut Point 1 D.” A new submenu item “Cut Point 1 D 1” is created, where one can 
enter the value of the x coordinate of interest. Click the new “Evaluation Group 1” 
item and select “Data Set:” “Cut Point 1 D 1” and then right click “Evaluation Group 
1” and choose “Point Evaluation.” Click on the new created “Point Evaluation 1” and 
the “Evaluate” button. Click on the “1 D Plot Group 1” menu item, thereby display- 
ing the solution at different time steps and as a function of position along the bar (see 
Fig. 7.21). The initial distribution at t = 0 is shown in Fig. 7.21. At the bottom of the 
figure, values are also shown at the specified position as chosen (that is, at 0.2 m). 
The two values shown in the bottom table match the exact values in Table 6.7 for 
x = 0.2 for times of 0.04 and 0.08 s. 


Example 7.1.4 (see Example 6.2.4). For this example, a hyperbolic equation is ap- 
plied for a vibrating string of unit length, where the ends of the string are fixed. We 
have 


dtu _ dtu (7.3) 
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zero initial velocity. 


Solution. Once again, the same methodology can be used as in the previous exam- 
ples. A one-dimensional model is again chosen for the string with the same boundary 
conditions as in Example 7.1.3. Instead, for this model, “Wave Equation (waeq)” is 
chosen for the physics under “Mathematics —~> “Classical PDEs.” The same geom- 
etry as in the last example is used as well as the Dirichlet boundary conditions. 


Eq. (7.3) can be input by clicking on “Wave Equation 1” and choosing c= 1, f = 0, 
and da = 1 (see Fig. 7.22). The initial value of (x < 0.5)*2*x+(x > 0.5)*2*(1— x) 
is input for the “Initial value for u” on clicking the “Initial Values 1” menu item. The 
string is then meshed and built. 
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FIGURE 7.22 
Input of the physics for the vibrating string. 


A “Relative tolerance” of 1E—6 can be chosen in the “Step 1: Time Dependent” menu 
item. An output time range with a step size of 0.2 s and total time of 1 s is chosen in 
accordance with Table 6.8. A solution can now be computed with the “= Compute” 
button after right clicking the “Study 1” menu item. 


Output results are shown for the string as a function of distance along the string and 
as a function of time (see Fig. 7.23). Also shown are values of the string displacement 
at the location x = 0.4 m for times of 0.2 s and 0.4 s in the bottom table below the 
output figure. The solutions match the exact solutions in Table 6.8. 


Ordinary differential equations 

COMSOL is a powerful tool typically used to solve complicated partial differential 
equations, particularly when several equations are coupled when they represent an 
underlying multiphysics nature (such as both heat and mass transfer). For example, 
see the multiphysics question in Problem 7.7. However, COMSOL can also be used 
to solve ordinary differential equations of first and second order, (or higher order 
decomposed into multiple coupled lower-order ordinary differential equations), as 
also detailed in Chapter 6. Again, this section draws on the examples presented in 
Chapter 6 for these types of problems. 
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Output solution for the vibrating string with results also shown at x = 0.4 m for several times. 


Example 7.1.5 (see Example 6.1.1 (Euler’s method)). The same problem is 
also solved with different solution methods of increasing accuracy, including Ex- 
ample 6.1.2 (Heun’s method), Example 6.1.3 (Runge—Kutta’s method), and Ex- 
ample 6.1.4 (Adams—Moulton’s method). This example demonstrates the solu- 
tion method with COMSOL for a first-order ordinary differential equation. We 
have 


d / 
= =xtytl y(0)=0 for0<x <2. (7.4) 
x 

The COMSOL solution can be compared to the most accurate values based on 


Adams—Moulton’s method in Table 6.4. 


Solution. The same methodology can be used as in the previous examples. A one- 
dimensional geometry is chosen with the “Interval” option selected by right clicking 
the “Geometry 1” menu entry, thereby creating a submenu item “Interval 1 (i1).” In 
this model, a “Coefficient Form PDE(c)” is chosen for the physics under “Mathe- 
matics —> “PDE Interfaces.” A “Dirichlet Boundary Condition” is selected on right 


ee O O o 


7.1 COMSOL application methodology 209 


YS Settings Properties Graphics 


aaa Ax i 
Gago aa 
Label: Dirichlet Boundary Condition 1 — e 5 
B 
¥ Boundary Selection 
Selection: Manual 2 
a 3 
Override and Contribution 
Equation — - 
¥ Dirichlet Boundary Condition 
[Z Prescribed value of y 
Dat ro 4«— 1 
m De 
E Tables 
1D Plot Group 1 
@ Evaluation Group 1 
© Export 
IE) Reports 
‘iz v Diiin. 0. 5 EA ji TE 7 5 Coes 2 
FIGURE 7.24 
Input of the Dirichlet boundary condition for the first-order ordinary differential equation in 
Eq. (7.4). 


clicking the “Coefficient Form PDE(c).” Similarly, as in previous examples, a new 
submenu “Dirichlet Boundary Condition 1” is created, from which one assigns “r:” 
equal to O in the box for the “Prescribed value of r” to apply the boundary condition 


y(0) = 0 at the point x = 0 on the interval (see Fig. 7.24). 


Eq. (7.4) can be input by clicking on “Coefficient Form PDE 1” and choosing c = 1, 
a=-l, f =x + l, ea = 0, da =0,a = 0, $ = 1, and y = 0 (see Fig. 7.25). The ob- 
ject is then meshed and built. A solution can now be computed with the “= Compute” 
button after right clicking the “Study 1” menu item. 


Output results for y(x) are shown in Fig. 7.26. Also shown is the specific value of 
y at x = 2.0 in the bottom table below the output figure using the same method as 
before for selecting (cutting) a specific one-dimensional point. The solution matches 
the exact solution in Table 6.4. 


Example 7.1.6 (see Examples 6.1.5 (Runge-Kutta—-Nyström’s method)). This 
example demonstrates the solution method with COMSOL for the following second- 
order ordinary differential equation: 


d2 
dx 


< 


N 


1 dy dy 
=-(x+y+—+1 y0)=0, —(QO)=0 for O<x<1. (75) 
2 dx dx 


The COMSOL solution can be compared to the exact values in Table 6.5. 


Solution. The same methodology can be used as in Example 7.1.5, except that the 
problem can be recast as a one-dimensional “Time Dependent” study (in contrast to 
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Output solution for y(x) for Eq. (7.4) with results also shown at x = 2.0. 
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Input of the physics for the second-order ordinary differential equation in Eq. (7.5). 


a “Stationary” study as in the previous example). In this problem, one can simply 
replace x by ¢ in the ordinary differential equation to better capture the two condi- 
tions in Eq. (7.5) as initial conditions: y(t = 0) = 0 and Da = 0) = 0. Similarly, 
a “Coefficient Form PDE(c)” is chosen for the physics under “Mathematics —> 
“PDE Interfaces.” However, no Dirichlet boundary conditions need to be considered 
in this problem. Although a one-dimensional geometry is chosen again with “Interval 
1 (i1); this assignment is somewhat arbitrary as there is no relevant spatial dimension 
in this problem so that default values can be assumed for the “Coordinates.” Instead, 
the initial conditions are captured in the “Initial Values 1” submenu that is created on 
right clicking “Coefficient Form PDE(c)” and selecting the option “Initial Values.” 


Eq. (7.5) can now be input by clicking on “Coefficient Form PDE 1” and choosing 
c=0,a=0, f =0.5(t + y+ 1), eg = l, da = —0.5, a = 0, B = 0, and y =0 
(see Fig. 7.27). A step size of 0.2 and a range from 0 to 1 can be input in the Step 1: 
“Time Dependent” submenu with a “User controlled” input for a “Relative tolerance” 
of 1E—6. The object is then meshed and built. A solution can now be computed with 
the “= Compute” button after right clicking the “Study 1” menu item. 


eee 
212 CHAPTER 7 Finite element methods 


Mo Settings Properties Graphics Convergence Plot 1 
$ eee he G aaa @ 
4@E D_2nd order_new.mpt = Evaluate 
= Evaluate 
4 @ Global Definitons Line Graph: Dependent variable y (1) ad 
Parameters 1 Label: Evaluation Group 1 E 
s$: Materials 0.4 
- 
4 Component 1 (comp?) pa 
Definitions Data set Cut Point 1D 1 -) Š oss 
Geometry 1 
© Moterels Time selection: | All - 
a r 
¥ Storage È 0.3 
Store table: In model Sar 
> 0.25 
Maximum number of rows: 10000 $ 
2 
Transformation $ 02 
F H 
rmat 
1E Data Sets SE ? 
tN Derived Values $ os 
H Tabh o 
4 1D PlotGroup 1 4— 
0.1 
Line Graph 1 
4 © Evaluation Group 1 
t Evaluation 14{— east 


® Export 
Reports 
° 
o 0.2 0.4 0.6 0.8 1 
'x-coordinate (m) 
Messages Progres; Evaluation Group 1 X 
a HBs s/=/Me 2 88S m- 
Time (s) Dependent vgriable J (1), Point: 0 
90000 0.23682 
0.40784 


Output solution for y with results also shown at several time steps. 


Output results for y as a function of x yield a different flat line at each time step since 
the problem has been recast as a time-dependent one (see Fig. 7.28). The value of 
y is also shown at several time steps in the bottom table. Using the same method as 
before for selecting (cutting) a specific one-dimensional point, a table is generated of 
y versus t. Here one can choose any arbitrary value of x within the problem range 
(since the line is flat) to capture the solution. 


The solution in Fig. 7.28 matches the exact values shown in Table 6.5 (where the time 
step value is recognized as a value of x»). 


7.2 General theory of finite elements 


The finite element method, or equivalently finite element analysis, is a set of nu- 
merical techniques combined to approximate solutions to differential equations on 
arbitrary domains. It is similar to the finite difference method encountered in Sec- 
tion 6.2, with the primary advantage being easier handling of complex geometries 
and variable spatial resolution. At a high level, the finite element method obtains a 
piecewise approximation to the solution of a differential equation with a controllable 
level of accuracy. 


lL O 


7.2 General theory of finite elements 213 


First, the discretization is achieved by dividing the domain into a finite number 
of smaller, geometrically simple subdomains called elements, from which the tech- 
nique derives its name. The individual elements together are referred to as the finite 
element “mesh” and the process of subdividing the domain is called “meshing.” The 
solution is approximated as the sum of polynomial basis functions with unknown 
coefficients in each element. The unknown coefficients, called degrees of freedom, 
are determined such that they minimize the residual (a measure of error) when the 
approximate solution is utilized in the differential equation. This process converts the 
differential equation into a multidimensional minimization problem. 

This minimum of the error estimate is found by differentiating the error with re- 
spect to each unknown leading to a set of nonlinear equations which can be solved 
numerically by techniques such as a damped Newton’s method described in Sec- 
tion 9.1.2, resulting in a series of linear equations. The optimal combination of the 
unknown coefficients which minimizes the error over the whole domain is known as 
the solution vector. Decreasing the size of the elements and/or increasing the number 
of basis functions in each element increases the number of degrees of freedom, such 
that the approximate solution becomes closer to the true solution. This approach also 
increases the computational costs since more unknowns must be determined. 

In the following sections this process is described in greater detail. Many finite 
element packages are available commercially (such as COMSOL MULTIPHYSICS, 
demonstrated in Section 7.1) or as open source software. These packages provide 
tools to perform these steps with varying levels of user input. Typically the user de- 
fines the model in the form of: (1) the relevant physics or mathematical equations, 
(2) the geometry and finite element mesh, (3) boundary conditions, and (4) solution 
techniques and tolerances for solving the nonlinear and linear systems of equations. 
The user is also required to provide judgment to ensure that the approximations at 
each step are sufficiently accurate for the intended application. 


7.2.1 Finite elements and shape functions 


This section introduces the concept of finite elements and their corresponding shape 
functions which can be utilized to approximate the solution to partial differential 
equations. The process for creating elements of arbitrary order is demonstrated in 
one dimension, which is then extended to higher dimensions. In Section 7.2.2 these 
elements are used to discretize differential equations and obtain solutions. 


One dimension 

Linear elements 

Suppose we wish to approximate a solution on a one-dimensional domain as a con- 
tinuous piecewise linear function. For linear elements the approximate solution & in 
each element is & = ag + a,x, where ao and a; are the unknown coefficients defined 
independently in each element. We define nodes at both ends of each element, num- 
bering the left node in each element as | and the right node as 2. We can rewrite u in 
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terms of the value of u at these nodes (that is, %1 = u (x1) and uy = u (x2)) as 


uy a2 1 X1 ag 
k-i all) 09 


which can be solved for the unknown coefficients 


e | 2 M 
do}_ {1 x a) 1 x2 —x1 1) [č 
H 7 fi A H X2 X] E 1 | H i (7.7) 


Using these values of aọ and a; the approximate solution can be written as 


xu, —xju2 —ü + U2 
Xx. 


ü= (7.8) 
X2 — X1 X2 — X1 
Collecting ŭı and i2 yields 
~ %27. =x +x 3 = 
ü = üi + ü = Nı (x)uy + N2 (x)ü2. (7.9) 
X2 — X] x2 — X1 


The functions N\(x) and N2(x) are known as shape functions, interpolation func- 
tions, or basis functions. The approximate solution is written as a sum of the product 
of the basis functions times the nodal values (that is, the solution is a linear combina- 
tion of basis functions). The method of determining the nodal values for a particular 
problem is discussed in the next section. The Nj (x) shape function decreases linearly 
from Nı (x1) = 1 to Nj (x2) = 0 across the element, while the N2(x) shape function 
increases from N2(x1) = 0 to N2(x2) = 1. Outside the element in which they are 
defined, their value is zero. 

The shape functions are defined independently on each element, but neighbor- 
ing elements can share the node between them (the right node of the left element 
is the left node of the right element), resulting in a continuous solution across ele- 
ments. The linear basis functions corresponding to a hypothetical finite element mesh 
with elements with ranges [0, 1.5], [1.5, 3], and [3, 4] are shown in Fig. 7.29(a). In 
Fig. 7.29(b) these elements and additional elements with ranges [4, 4.5], [4.5, 4.75], 
[4.75, 4.875], and [4.875, 5] are used to approximate y = sin(x) by interpolation. 

Note that the error in the approximation is nonuniform. The error is lowest when 
the function is approximately linear on the scale of the element. If the function devi- 
ates significantly from linear on the scale of the element, the approximation cannot 
reproduce the nonlinearity. More smaller elements produce less error compared to 
fewer larger elements. The approximation is continuous, but the derivatives at the 
nodes are not continuous. This is referred to as c? continuity, the zero here referring 
to the number of continuous derivatives. Although not covered in this work, higher- 
order continuity can be achieved with more advanced finite elements such as Hermite, 
but these elements come at additional computational cost and are usually reserved for 
special applications. 
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(a) One-dimensional linear shape functions for element number 1 [0, 1.5] (solid), element 
number 2 [1.5, 3] (dashed), and element number 3 [3, 4] (dash-dot). (b) y = sin(x) (thick 
solid line) and a linear finite element approximation (thick dotted line) along with the 
contribution from individual shape functions for a nonuniform finite element mesh (thin 
solid lines). 


Quadratic elements 


Suppose now one wants to approximate a solution on a one-dimensional domain 
as a continuous piecewise quadratic function. The above process is repeated with 
ii = ag + ayx + ax? with the additional unknown coefficient a. Since there are now 
three unknown coefficients we position three nodes in each element, one at each end 
and one located inside the element. Numbering these nodes from left to right, the 
unknown coefficients in terms of the values at the nodes are 


-1 


ao 1 x A ity 
a}=]1 x X5 üz |. (7.10) 
az 1 x3 x3 ü3 


As before, one: (1) calculates the inverse matrix to obtain the unknown coefficients, 
(2) substitutes the unknown coefficients into the polynomial form, and (3) collects 
nodal values (%1, #2, and &3) to obtain 


ü = Ni (x)üı + N2(x)ü2 + N2(x)ü2 (7.11) 
in terms of the quadratic shape functions 


X2X3 (x3 — X2) + (x2 — x3) x + (x3 — x2) x? 


M(x) = f 
—x?x2 + xix? + x2x3 _ x3x3 a xix? + x2x3 


(7.12a) 


x13 (x1 — x3) + (x3 — x?) x + @1 — x3) x? 


Nox) = — 5 2 fod 2 2 , 
Xp X2 + x1xX3 + XX3 — X5X3 — X1X3 + X2X3 


(7.12b) 
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(a) One-dimensional quadratic shape functions for element number 1 [0, 1.5] (solid), 
element number 2 [1.5, 3] (dashed), and element number 3 [3, 4] (dash-dot). (b) y = sin(x) 
(thick solid line) and a linear finite element approximation (thick dotted line) along with the 
contribution from individual shape functions for a nonuniform finite element mesh (thin 
solid lines). 


X1X2 (x2 — x1) + (x? — x3) x + (x2 — x1) x? 


z: 


N3(x) = -7 2,2 2 2 
=x x2 + X1X5 + XIX3 — X3X3 — X1X3 + X2X3 


(7.12c) 


This is a general expression where x2 can be located anywhere x; < x2 < x3, 
which can be simplified if x2 is located at the center of the element such that x2 = 
(x1 + x3) /2 so that the shape functions become 


(x — x3) (2x — x1 — x3) 


z 7.1 
Mia) a ee (7.13a) 
Veja UU 2 (7.13b) 
(x1 — x3) 
poa E ca im (1.130) 


(x1 — x3)? 


Again, adjacent elements share the connecting node such that t3 of the left ele- 
ment is #1; of the right element, which produces a continuous approximation across 
elements. These quadratic basis functions are shown in Fig. 7.30(a) for the same 
nonuniform mesh, and applied to approximate y = sin(x) in Fig. 7.30(b). Note the 
approximation more closely resembles y = sin(x) compared to the example with lin- 
ear elements and now includes some of the curvature. 


Higher-order elements 


A feature of basis functions of any order constructed in this manner is that they always 
have a value of one at their corresponding node and zero at all other nodes. These 
correspond to Lagrange interpolation polynomials of order n (see Section 9.2.1) from 
which we can write the ith shape function, J (x) = Nj (x), as 
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Comparison of shape functions of orders 1-6. 
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(7.14) 


These are plotted in Fig. 7.31 for element orders 1—6 for evenly spaced nodes. An 
nth-order Lagrange element in one dimension has n + 1 nodes corresponding to 
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3 (x3, y3) 


2 (x2, Y2) 


1 (x1, y1) 


(a) (b) 
FIGURE 7.32 


(a) A linear triangle element with three nodes. (b) Diagram of area coordinates for triangular 
element in part (a). 


n + 1 shape functions. The higher-order elements have more degrees of freedom 
per element and are thus able to more closely approximate an arbitrary function and 
achieve lower error per element at increased computational cost. The computational 
cost per degree of freedom is also higher since there are more overlapping shape func- 
tions leading to more complicated systems of linear equations. The optimal balance 
of element order and mesh density is problem-specific. Finite element implementa- 
tions which focus on controlling error by using high-order elements are known as 
“‘p-methods,” while those that instead focus on reducing the size of relatively low- 
order elements are known as “h-methods.” The combination of the two approaches is 
called “hp-methods.” 


Two dimensions 

Linear triangular elements 

For the two-dimensional case, we can begin by decomposing our domain into tri- 
angular elements with a node at each vertex, as shown in Fig. 7.32(a). With three 
nodes, we have three degrees of freedom to approximate u(x, y). The lowest-order 
polynomials with three degrees of freedom are 


(x, y) = ao tax + a2x?, (7.15a) 
u(x, y) = ao + ax + ary, (7.15b) 
ŭ(x, y) =a +a1y +My”. (7.15¢) 


However, it is usually desirable to maintain equal powers of both x and y so that the 
quality of the approximation is independent of the rotation of the element. Therefore, 
one can proceed with Eq. (7.15b). 
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Following the same procedure as for the one-dimensional case, one can write u in 
terms of the nodal values to obtain the coefficients ag to a2, 


ao lox y uy 
a|=|1 x2 y it2 
az 1 x3 y3 ü3 
1 | 293-4392 XBL —xX1y3 X1y2— %21 | | 1 
=5,| 027% —yt+y3 yi =}2 üz |, (7.16) 
—x2+ x3 X1 — X3 —x1 + x2 U3 


where A is the area of the element (and the determinant of the matrix) given by 
A= x1 (y2 — y3) + x2 (—y1 + y3) + x3 (yı — y2). By collecting the terms the shape 
functions are 


xX2y3 — x3 y2 + (y2 — y3) x + (—x2 + x3) y 


M(x, y) = a (7.17a) 
—x1y3 + x31 + (—y1 + y3) x + (xı — x3) 
No (x, y) = * F 4 a (7.17b) 
x1y2 — X21 + (y1 — y2) x + (—x1 + x2) 
N3 (x,y) = a 2 (1.170) 


Area coordinates for triangles 

The linear shape functions derived in Eq. (7.17) are also known as “area coordinates” 
or “triangle coordinates.” They are a useful coordinate system for working with tri- 
angles that do not have orthogonal edges and will be used to derive the higher-order 
triangular elements. In two dimensions, Cartesian coordinates have two indepen- 
dent components (e.g., P(x, y)), while area coordinates also have two independent 
components plus a third dependent component (e.g., P(L1, L2, L3)), as depicted in 
Fig. 7.32(b). A point within an arbitrary triangle can be uniquely defined by 


Lı = Nı (x,y), (7.18a) 
L2 = M2 (x,y), (7.18b) 
L3 = N3 (x, y) (7.18c) 


or by recognizing that the constraint Nj + N2 + N3 = 1 = Lı + L2 + L3 can pro- 
duce the third dependent coordinate from any two independent coordinates. The name 
“area coordinates” arises because their value relates to the area of the shaded subtri- 
angles in Fig. 7.32(b) to the area of the whole triangle 


z area (P23) l (7.19a) 
area (123) 

_ area ee) (7.19b) 
area (123) 

_ area(P12) (7.19¢) 


~ area (123) ` 
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1 (x1, y1) 
(1, 0, 0) 


FIGURE 7.33 


(a) A quadratic triangle element with six nodes with global Cartesian coordinates and local 
area coordinates. (b) A cubic triangle element with 10 nodes with local area coordinates. 


To return to Cartesian coordinates from area coordinates the relationship is 


x=L1,x, + Lox. + L3x3, (7.20a) 
y= Liyı + Loy2 + Lys. (7.20b) 


Quadratic and cubic triangular elements 


The second-order triangle, shown in Fig. 7.33(a), has six nodes (three vertex nodes 
and three edge nodes) such that u (x, y) = ao + aix + a2y + a3x° + a4xy + asy. 
The same process outlined throughout this chapter can be utilized to determine the 
coefficients ag to as and therefore the shape functions. However, since there are now 
six nodes, this method requires inverting a 6 x 6 matrix for each element to obtain the 
shape functions. The growing complexity of the shape functions, as well as the even- 
tual need to efficiently integrate functions over each element, motivates an alternate 
approach to derive high-order and high-dimensional elements. Instead, one can write 
the shape functions in terms of the area coordinates L1, L2, L3 defined by Eq. (7.17) 
and Eq. (7.18). One can write the shape functions as products, which is one at the 
corresponding node and zero at all others: 


Ni = (2L; — l) L}, (7.21a) 
No = (2L2 — 1) Lo, (7.21b) 
N3 = (2L3— 1) L3, (7.21c) 
N4 =4L1L2, (7.21d) 
Ns =4L2L3, (7.21e) 


Ns = 4L1 L3. (7.21f) 
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Similarly, the shape functions of the 10-node cubic triangle shown in Fig. 7.33(b) 
may be written as 


N Z L : L i L (7.22a) 
= , .22a 
1 2 1 3 1 3 1 
N: 2 L ! L z L (7.22b) 
= 2 2 3 2 3 2> . 
N: , L : L 2 L (7.22c) 
= -2LC 
3 5 3 3 3 3 3, 

27 1 
Ng=—L1L2{Li-=), (7.22d) 

2 3 
N: Ti L2(L 2 (7.22e) 
= — => 222 

5 2 142 1 3 , 

27 1 
No = —L2L3| La- =), (7.22f) 

2 3 

27 2 
N7 = —LzL3 | L2- - |), (7.22g) 

2 3 

27 2 
Ng = — Lı L3 | Li — 3) (7.22h) 
1 ; 
No = — Lı L3 (n = 5) ; (7.221) 


This process can be repeated to efficiently produce triangular Lagrange elements 
of arbitrary order. 


Linear quadrilateral elements 

The other common element shape for discretizing in two dimensions are quadri- 
laterals. The simplest quadrilateral with one node at each vertex, as depicted in 
Fig. 7.34(a), is given by ü (x, y) = ao +a1x +a2y+a3xy. Again, the general process 
outlined in the previous sections could be applied in which the coefficients ap to a3 
are determined in terms of the coordinates (xp to x3 and yo to y3) by solving a linear 
equation in order to obtain the shape functions. However, like the higher-order trian- 
gles, an alternate approach is to use local element coordinates (£, 7) to more easily 
account for potential nonparallel sides. First, one considers the simple quadrilateral 
in Fig. 7.34(b) with nodes at € = +1 and n = +1. For this simple element geometry, 
one can formulate the shape functions in terms of Lagrange interpolation polynomi- 
als (Eq. (7.14)) along the edges, which are easily defined in (£, 7) (see Fig. 7.35(a) 
and Fig. 7.35(b)), 


1 
Mie m=z- n), (7.23a) 


1 
MEM=70+h)0-, (7.23b) 
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FIGURE 7.34 


(a) A quadrilateral element with four nodes in global (x, y) Cartesian coordinates. (b) A 
quadrilateral element with four nodes in local (£, 7) coordinates. 


3 (x3, Y3) 


FIGURE 7.35 


(a) A quadrilateral element with lines of constant local (£, n) coordinates. (b) 
Three-dimensional visualization of the N3 shape function over the sample element. 
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1 
ME. n=7d+s +n), (7.23c) 


1 
MG =7 07H +n). (7.23d) 


The global coordinates (x, y) corresponding to local coordinates (£, 7) are a linear 
combination of the nodal global coordinates and local shape functions 


x (E, n) = Ni (E, n) x1 + N2 (&, n) x2 + N3 (E, n) x3 + N4 (E, n) X4, (7.24a) 
y (E, n) = M1 (E, n) yi + N2 (E, n) y2 + N3 (E, n) y3 + Na E, n) ya. (7.24b) 


This relationship allows arbitrary quadrilateral elements without parallel sides to be 
integrated or plotted parametrically using the easy to define domain in local (&, 7) 
coordinates. Spatial derivatives such as a requires the derivatives of the shape func- 
tions in terms of the global (x, y) coordinates which can be obtained by applying the 
chain rule, 


ƏN (En) _ ƏN (E, n) dx | INE, n) Oy 


= ; 7.25a 
0g Ox Ə = dy 0& € ) 
ON (E, ƏN (E, n) 0 ƏN (E, n) 0 
6,1) _ ƏN G, n) dx T E, n) dy l (1.250) 
an əx on dy on 
which may also be written in terms of the Jacobian matrix 
ƏN (E.n) əx ay aN(é.n) 
0g —| 0 0 a 
INEM | | ax ay ane | os) 
an an ðn dy 


Higher-order quadrilateral elements 


Using a local coordinate system in which the nodes are conveniently arranged, 
higher-order elements can be easily created from products of Lagrange interpolation 
polynomials in terms of £, 7. A nine-node element supports a polynomial of the form 
ü (x, y) = aọ + ax + a2y + a3xy + a4x? + asy? + agx? y + axy? + agx? y”. The 
nodes form an evenly spaced grid local at £ = —1, 0, 1 and n = —1, 0, 1. Examples 
of the shape functions corresponding to a corner, edge, and central node are shown in 
Fig. 7.36. 

Similarly, a 16-node element supports a polynomial of the form u (x, y) = ao + 
aix + ary + a3xy + a4x? + asy? + asx? y + a7xy? + agx? y? + aox? + ayoy? + 
aj1x?y + a12xy? + a13x? y? + ajx? y? + aysx°y?. The nodes form an evenly spaced 
grid local at £ = —1, +, L, l and ņn = —1, 4, i 1. Examples of the shape functions 
corresponding to a corner, edge, and central node are shown in Fig. 7.37. 
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FIGURE 7.36 


Shape functions for a nine-node quadrilateral corresponding to a node located: (a) at the 
corner (€ = 1, n = 1), (b) along the edge (¢ = 1, n = 0), and (c) in the middle ( = 0, n = 0). 
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FIGURE 7.37 


Shape functions for a 16-node quadrilateral corresponding to the nodes located: (a) at the 
corner ( = 1, n = 1), (b) along the edge (é = 1, n = 3), and (c) in the middle ( = 4, = 4). 


7.2.2 Solution of partial differential equations 

Weighted error 

In this section, one utilizes the finite element basis developed in Section 7.2.1 to find 
the approximate solution to a partial differential equation which minimizes the error. 
Once the mesh and shape functions are defined, the approximate solution to a partial 


differential equation can be written as the sum of the product of the shape functions 
times the magnitude at each of the nodes: 


ii(®) = Y N @u = [Ñ]: [Ää]. (7.27) 
i=l 


To demonstrate this process, consider the example of Poisson’s differential: 


V-u — f(x) =0. (7.28) 
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Since & is approximate rather than exact, it may not perfectly satisfy the differential 
and one expects to have a spatially varying residual error R(x) leading to 


Vi — f(x) = R(X). (7.29) 


This error can be converted to a global measure of error by introducing a currently 
undefined weighting function W(x) and integrating over the domain where the error 
is defined as 


Eror= | wrav = | WG) (vi - f@)av (1.30) 
V V 
af WOVE — WR) F(R) dV. (7.31) 
V 


For a smooth, well-behaved weighting function, it is advantageous to manipulate this 
error considering the divergence of a scalar vector product ab, 


y. (ab) Vab UND: (1.32) 


If a = W (È) and b = VN(x)i, the W(x) V2N(x)a term in the error may be written 
as 


Error = / (V-W@)VN(i)i -VW - VN) — WR) FR) dV. (1.33) 
V 


The divergence theorem in Eq. (4.16) may be applied to the first term on the right- 
hand side to convert it to a surface integral, 


Error = $ i-W)VN(X)adS — | VWE) VN) +WÆ@FfR)AV. 
S V 
(7.34) 


The surface integral term is associated with the boundary conditions, and is often but 
not always zero (i.e., no flux boundary conditions). This manipulation has effectively 
transferred one of the derivatives from the shape functions to the weighting functions, 
which can be advantageous for obtaining numerical solutions. 


Weak form 


If the error is zero and the weighting function is well behaved, this formulation is 
known as the “weak form”, because it is equivalent to the “strong form” shown in 
Eq. (7.31), but has weaker (relaxed) smoothness requirements on the solution (and 
therefore the shape functions). In the weak form, the weighting functions are known 
as test functions. The general process for obtaining the weak form starting with 
the strong form follows the same approach: (1) multiply the strong form by a test 
function, (2) integrate the equation over the whole domain, and (3) manipulate the 
equation to transfer some of the derivatives to the test function. 
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Galerkin method 

The choice of the weighting and shape functions produces different numerical meth- 
ods: least squares, collocation (i.e., the sum of fixed points), and the Galerkin 
method, among other techniques. For finite element methods the most common is 
the Galerkin method, which sets the weighting function equal to the shape functions 
W(x) = N; (X) and searches for an approximate solution with error equal to zero. For 
each node this approach leads to equations of the form 


(VN; (Xx) - VN(x)ii + Ni (x) f (%))dV =0, (7.35) 
V 


which collectively produce a system of equations for the unknown components of 
u. The components of u are the unknown nodal values &;, which are not spatially 
varying and can therefore be factored from the integrals, leading to 


ui f VNiVNidV +u f VNiVNodV +--+ un f VNV N dV 
uy | VNoVNidV +u f VN2VNodV +--+ + un f VN2VNndV 


uy f VNraVNidV +u |f VNaVNodV +--+ un f VNnVNndV 


[Ni fav 
f Nofdv 
=— f (7.36) 
JS NafaV 
Finally, factoring ú yields a matrix equation 
JVMVNdV [VN VNAV ... [VNIVNndV | Puy 
[VN2VNidV f[VNoVNodV_... f VN2VNndV | | v2 
[VNnaVNidV [VNaVNodV... fVNaVNndV_| L¥n 
{Mi fav 
| Nofdv 
=— : ; (7.37) 
[Nn fav 
or in matrix form 
[K]ū = F (7.38) 


lll Ee 


7.2 General theory of finite elements 227 


Here the matrix K is termed the “stiffness” matrix. The elements of the stiffness 
matrix have the form 


Ki,; = [ vnivnjav. (7.39) 


Typically most of the entries in this matrix evaluate to zero because the shape func- 
tions are only nonzero in the elements which include the node (i.e., nonoverlapping 
shape functions yield zero for this integral). This produces a “sparse” matrix where 
most entries are zero and only the nonzero values are stored, which reduces the com- 
putational costs compared to “dense” matrices. 

The calculation of the nonzero elements of the matrix K and the vector F requires 
evaluation of the integrals on each finite element. In most finite element implementa- 
tions, this calculation is usually performed numerically with a “quadrature” method 
that approximates an integral as a weighted sum of the integrand evaluated at specific 
points: 


n 
J g&©dV =Y wigii. (7.40) 
i=l 

Here w; are the quadrature weights applied to the integrand gx evaluated at x; and 
n is the number of points evaluated. The most widely used quadrature method is 
“Gaussian quadrature,” which is exact for polynomials 2n — 1 or less (see, for ex- 
ample, Section 9.5.3). Other quadrature techniques such as “Gauss—Kronrod” are 
sometimes utilized when estimates of the integration error are required. In practice it 
is often sufficient to integrate to a higher degree but skip the error test. 

The element formulations are normally precalculated for standard equations and 
element types. The stiffness matrix is assembled by adding the contributions from 
each element in the mesh. This “stiffness” matrix depends on the partial differential 
equation, the boundary conditions, and the mesh. 

Boundary conditions need to be applied that will constrain the system to obtain 
a nonsingular solution. If the system is nonlinear, an iterative approach is utilized 
in which the stiffness of each element is calculated using the previous solution. The 
stiffness is updated on each iteration along with the solution vector and the calculation 
proceeds until convergence is achieved. For example, a nonlinear system may involve 
the calculation of temperature, where the thermal conductivity itself is dependent on 
the temperature. 

Once the linear system of equations from Eq. (7.38) is assembled, standard so- 
lution techniques can be applied as discussed in Chapter 4. There are two main 
approaches used for finite elements: 


e Direct matrix factorization. Techniques such as lower-upper factorization are more 
stable/robust and simpler to use and configure. However, it is slower for large 
problems and requires more memory. 

e Iterative methods. These methods solve the system by updating an initial guess; 
the most common are Krylov methods such as GMRES and BiCSTAB. They often 
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require less memory and are faster than the direct factorization method for large 
(usually three-dimensional) problems. However, it is less stable for poorly/ill- 
conditioned problems. 


Additional considerations for finite element method analysis 


It is important to recognize that the finite element mesh does not have a physical 
meaning since the true solution does not depend on the mesh itself, and an infinite 
number of meshes can provide equivalent results. Changing the mesh a small amount 
should not substantially change the answer. If it does, the result is an artifact of the 
solution process and is not representative of the true solution. It is good practice to 
perform a mesh convergence study to ensure that refining the mesh does not strongly 
alter the solution. The mesh is simply a part of the finite element method, which must 
be small enough in order to “accurately” represent the solution. 

On the other hand, the boundary conditions are extremely important and are phys- 
ically relevant to the problem. They can be simple (e.g., a fixed displacement or 
constant temperature) or extremely complicated (e.g., with the contact of adjoining 
surfaces, friction between surfaces, or no slip flow). 

One must also ensure that the element order is sufficient to capture the desired 
properties of the solution. For example, consider a displacement field which satisfies 
a solid mechanics model. Components of the strain (€) and stress (ø) are related to 
the derivative of the displacement u such that 


_ ou 
~ Ax’ 
ðu 


Ox = Bey = E7 (7.42) 


Ex (7.41) 


where E is the Young’s modulus. For a linear triangular element, u(x, y) = ap + 
aix +a2y = Ni (x, y)ui + N2 (x, y)u2 + N3(x, y)u3 from Eq. (7.15b) and Eq. (7.17), 
and therefore the strain is 


Ou(x, ON (x, a N(x, Ə N3(x, 
u(x, y) CN 1x, y) P 2(x, y) ee 3(x, y) (7.43) 

Ox Ox Ox Ox 
_ (y2 — y3) 41 + (—y1 + y3) U2 + (yı — Yo) 3 (7.44) 


2A 


which is no longer a function of the spatial coordinates. Therefore, the strain and 
stress field within linear triangular shape functions cannot vary smoothly between 
elements. By contrast, the derivatives of higher-order triangular elements, or linear 
quadrilateral elements, are not constant and therefore better able to resolve the strain 
fields. In general there is no clear “best” element for all applications, i.e., this choice 
depends on the application. One needs to consider what features are expected/needed 
to be captured. Can rectangular elements be used? How significant are the gradients? 
Is there a region of interest? How many spatial derivatives are required? 

Similarly, to achieve convergence one requires an acceptable error tolerance that 
depends on the quantity being solved for, the required accuracy needed for the solu- 
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tion, and the accuracy of the underlying physics and known material properties. Other 
considerations include (i) the available computing resources (i.e., the random-access 
memory [RAM] or central processing unit [CPU]); (ii) the degree of nonlinearity, 
which has a strong effect on the solution convergence (i.e., large deformation and 
geometric stiffness); (iii) the type of problem being solved (e.g., stationary, time- 
dependent, eigenvalue, frequency response); and (iv) whether the problem involves 
single physics or multiphysics (with one- or two-directional coupling). A more de- 
tailed introduction to finite elements is provided in Hutton (2004) and Zienkiewicz et 
al. (2013). 


Problems 


7.1 Solve the time-independent Laplace heat conduction equation in Example 6.2.1 
with the COMSOL software package. 


7.2 Solve the Poisson equation in Example 6.2.2 with the COMSOL software pack- 
age. 


7.3 Solve the time-dependent heat conduction equation in Example 6.2.3 with the 
COMSOL software package. 


7.4 Solve the wave equation in Example 6.2.4 with the COMSOL software package. 


7.5 Solve the first-order ordinary differential equation in Example 6.1.4 with the 
COMSOL software package. 


7.6 Solve the second-order ordinary differential equation in Example 6.1.5 with the 
COMSOL software package. 


7.7 A time-dependent fission product diffusion model in the nuclear fuel matrix can 
be coupled with a mass balance in the fuel-to-sheath gap and coolant to predict the 
coolant activity behavior from a defective fuel element for variable power operation 
in a nuclear reactor.! The diffusion equation for the fission product concentration C 
in an idealized fuel grain sphere of radius a, accounting for radioactive decay A with 
a time-dependent diffusion coefficient D and fission product generation rate B (= 
fission rate F times the fission product yield Y), is 


F(Y 
) AC (r,t) + y` (7.45) 


~ or 


dC(r,t)  D(t) a 7 0C(r, t) 
= r 
ot r2 ðr 


1 BJ. Lewis, P.K. Chan, A. El-Jaby, F.C. Iglesias, and A. Fitchett, “Fission product release modeling for 
application of fuel-failure monitoring and detection — An overview,” J. Nucl. Mater. 489 (2017) 64-83. 
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Defining the dimensionless variable x = r/a and multiplying through by the volume 
V yields 


dux, t) D'(t) a ( du(x, t) 


Xr F(t)Y 7.4 
ot x? Ox Ox ) eT EN (140) 


where u = CV and D’ = D/a?. The initial and boundary conditions are given as 


u(x,0)=0, 0<x<1, t=0, (7.47a) 
ðu 
—=0, x=0, t>0, (7.47b) 
Ox 
u(l,t)=0=0, x=1, t>0. (7.47c) 


A zero concentration is assumed at t = 0. For the boundary conditions, symmetry is 
assumed at the center of the sphere (i.e., a zero flux) and a zero concentration at the 
fuel grain surface with fission product release from the grain surface. The diffusional 
release-to-birth rate ratio derived from the Fick’s law of diffusion is 


R 4ra? acVv 3D' a 
po D a eas (7.48) 
B FY (4ra? /3) or | a FY ðx |= 
Equivalently, the release rate R (atoms s~!) from the fuel matrix is 
ð 
R39 | . (7.49) 
OX |1 


Thus, the time-dependent diffusion equation in Eq. (7.46) can be solved by numerical 
methods subject to the conditions in Eq. (7.47a) to Eq. (7.47c). The derivative of this 
solution (at x = 1) is subsequently used in Eq. (7.49), which is the source release rate 
from the fuel matrix into the fuel-to-sheath gap. As such, the mass balance for the 
number of atoms Nọ in the gap is 


dN, (t) 
dt 


= R(t) — Atv) Ned), (7.50a) 
with initial condition 
N(t)=0, t=0. (7.50b) 


A first-order loss-rate process is assumed from the gap characterized by an escape rate 
coefficient v (s7!) which can also be considered as a function of time with element 
deterioration. The mass balance in the coolant is similarly given by 


dN¢(t) 
dt 


= Vv(t)Ne(t)-— A+ PN: (t) (7.5 1a) 
with initial condition 


N-(t)=0, t=0. (7.51b) 
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Here a time-dependent coolant purification rate constant £ (t) for the coolant is con- 
sidered. For this problem, the empirical diffusion coefficient D’ (s7!) as a function 
of the element linear power P (kW m7!) is derived from historical coolant activity 
data in the commercial stations as 


(7.52) 


19 0-114-P) 
t —12 
D = 8.6 x 10 a F 


In addition, assuming 180 MeV fission! and a fuel stack length of 0.48 m, the fission 
rate F (fissions s7!) = 1.489 x 10!3 P (kW m7!). 


(a) The solution of the coupled Eq. (7.46) to Eq. (7.52) provides a prediction 
of both the number of atoms in the gap and coolant as a function of time for a 
variable fuel element linear rating and coolant purification history. Solve the time- 
dependent governing transport equations using the commercial COMSOL software 
package (version 5.4) employing a finite element technique following the method- 
ology in Section 7.1. For this calculation, evaluate the I-131 activity concentration 
Ac(t) (uCi kg!) = AN-(t)/G.7 x 1074 Bq uCi™!)/244,000 kg in the reactor 
coolant over an irradiation period of 30 days at a constant linear power rating of 
P = 40 kW m” !. For I-131, the following data apply: Y = 0.0288 atoms fission”! 
and A = 9.98 x 1077 s~!. One can take v = 3 x 1076 s7! as a typical escape rate co- 


efficient for iodine with a reactor coolant purification constant of B = 5 x 1075 s~!. 


7.8 Find the weak form of V - Vu + 2u + exp(x) = 0. 


7.9 Starting with the weak form from Problem 7.8, use the Galerkin method to 
find the contribution to the element load vector from each shape function for a one- 
dimensional linear Lagrange finite element on the range x, to x2. 


7.10 Evaluate the contributions to the load vector from Problem 7.9 for a mesh 
consisting of one-dimensional linear Lagrange finite elements with four elements 
with nodes positioned at x = 0, 1, 1.5, 1.75, and 2. 


7.11 Assemble the global load vector from Problem 7.10. 


7.12 Starting from the weak form from Problem 7.8, use the Galerkin method to 
find the contribution to the element stiffness matrix from each shape function for a 
one-dimensional linear Lagrange finite element on the range x, to x2. 


7.13 Evaluate the contributions to the stiffness matrix from Problem 7.12 for the 
mesh described in Problem 7.10. 


7.14 Assemble the global stiffness matrix from Problem 7.13. 

7.15 Solve the linear system Ku = F for the unknown nodal values of u using K 
from Problem 7.14 and F from Problem 7.11. 

7.16 Graph the approximate solution from Problem 7.15 for u(x) over the range 
x = 0 to 2. Also graph oe over the range x = 0 to 2. 
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7.17 Repeat the steps in Problems 7.8 to 7.16 for the same mesh using second- 
order Lagrange elements with the internal node placed at the center of each element. 
Compare with the approximate solution obtained using linear finite elements. 


7.18 Solve the differential equation in Problem 7.8 with the COMSOL software 
package with a fine mesh and compare to the results obtained in Problems 7.16 and 
RII: 


CHAPTER 


Treatment of 
experimental results 


Every physical measurement is subject to a degree of uncertainty. “An engineer or 
scientist can ill afford to waste time in the indiscriminate pursuit of the ultimate in 
accuracy when such is not needed.” 


8.1 Definitions 


A measurement is normally taken several times. The “best” value of a measurement 
set may be determined from either the mean (average, x) or the median. The mean 
is obtained by dividing the sum of a set of replicate measurements by the number of 
individual results in that set. The median of a set is the result about which all others 
are equally distributed (for an odd number of measurements, it is the middle number 
of the set when all values are arranged in order of increasing magnitude; for an even 
number of measurements, it is the average of the central pair). 


Example 8.1.1. Calculate the mean and median for the measurement set 10.06, 
10.20, 10.08, 10.10. 


Solution: 
The mean = x = (10.06 + 10.20 + 10.08 + 10.10)/4 = 10.11, and the median = 
(10.08 + 10.10)/2 = 10.09 (since the set has an even number of measurements). 


The precision is used to describe the reproducibility of results and resolution of 
a measurement when measured again with the same instrument. The accuracy of a 
measurement denotes its nearness to a true or accepted value. A measurement that is 
highly accurate has an associated small error. In a number of circumstances, the true 
value for an experiment is not necessarily known so that only an estimated accuracy 
can be realized. Although very precise equipment can consistently measure small 
differences in a physical measurement, it may still be inaccurate as a high-precision 
measurement is not necessarily a sufficient condition for high accuracy. 

The deviation from the mean in absolute terms, |x; — x|, is a common method 
for describing precision. The spread or range (W = Xmax — Xmin) in a set of data 
is the numerical difference between the highest and lowest results. Sometimes, it is 
worthwhile to indicate the precision relative to the mean in terms of a percentage 
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Xi = X . . . 
(= x 100. The most important measures of precision are the standard devi- 
x 


ation and the variance (see below). The accuracy of a measurement is often described 
in terms of an absolute error, E = x; — x+, defined as the difference between the ob- 
served value x; and the true value x;. The relative error can also be expressed as a 


x 100%, where, in this case, the sign of the error is retained 
Xt 


to indicate whether the result is high or low. 

Experimental uncertainties that can be revealed by repeating measurements are 
called random errors. Errors that cannot be revealed in this way are called systematic 
errors. Thus, uncertainties that arise in a given measurement can be classified into two 
categories: (i) determinate errors (systematic errors) and (ii) indeterminate errors 
(random errors). Determinate errors arise from: (i) an instrument bias due to a cali- 
bration error or uncompensated drift for example (most of these are unidirectional); 
(ii) method errors (e.g., analysis based on nonideal chemical or physical behavior); 
and (iii) personal errors (for example, judgment bias in reading a scale, gross arith- 
metical mistakes). Instrumental errors are usually found and corrected by calibration. 
Indeterminate errors arise from an inability to discriminate between readings dif- 
fering by less than a small amount. This type of error cannot be controlled by an 
experimenter and eliminated, producing a random scatter of results for replicate mea- 
surements. As an example, Table 8.1 shows all possible ways that four uncertainties 


Xj 
percentage, ( 


Table 8.1 Possible ways four equal-sized uncertainties can combine. 
Combination of uncertainties Indeterminate error Frequency 
U U + U3 + U4 +4U 1 
U U2 + U3 + U4 
U U + U3 + U4 +2U 4 
U U2 — U3 + U4 
U U2 + U3 — U4 
U U2 + U3 + U4 
U Uz — U3 — U4 
U U2 + U3 — U4 0 6 
U Uz — U3 + U4 
U U2 + U3 — U4 
U Uz — U3 + U4 
+U, — U2 — U3 — U4 
—U, + U2 — U3 — U4 —2U 4 
—U, — U2 + U3 — U4 
—U, — U2 — U3 + U4 
—U, — Uz — U3 — U4 —4U 1 
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FIGURE 8.1 


Theoretical distribution of indeterminate error arising from (a) four, (b) eight, and (c) a large 
number of uncertainties. Curve (c) shows the theoretical Gaussian distribution and (d) is an 
experimental distribution curve (adapted from Skoog and West, 1982, p. 50). 


(of equal magnitude) can combine to provide an indeterminate error. The theoret- 
ical frequency distribution of errors is shown in Fig. 8.1(a). Similarly, Fig. 8.1(b) 
shows the theoretical distribution for 8 equal-sized uncertainties. Extending this to a 
very large number of uncertainties of smaller and smaller size, the continuous “bell- 
shaped” distribution curve of Fig. 8.1(c) results, called a Gaussian or normal error 
curve. Indeterminate errors in chemical analysis (e.g., deviations of the mean of a 
large number of repetitive pH measurements on a single sample) most commonly 
have a discontinuous distribution that approaches a Gaussian curve (see Fig. 8.1(d)). 
The Gaussian distribution represents the theoretical distribution of experimental re- 
sults to be expected as the number of analyses involved approaches infinity. The 
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Gaussian distribution of most analytical data permits the use of statistical techniques 
to estimate the limits of indeterminate error from the precision of such data. 


8.1.1 Standard normal distribution 


The Gaussian distribution or normal distribution function is a continuous probability 
distribution for a real-valued random variable. This distribution function is symmetric 
about the mean, which shows that data near the mean are more frequent in occurrence 
than data away from it. Since it is a probability, the area under the curve is unity. It 
can be described mathematically as (see Fig. 8.1(a)) 


f(x) = (8.1) 
Oo 


where mathematically u is the center of the distribution and ø is the width of the 
distribution. Specifically, the random variable x represents values of individual mea- 
surements, which are centered about the arithmetic mean p for an infinite number 
of such measurements. The parameter f is the frequency of occurrence for each 
value of the deviation from the mean (x — u), and o, which is called the standard 
deviation, characterizes the width of the curve. A property of f includes its nor- 


[0,6] 
malization, f f(x)dx = 1. Also, by definition from calculus theory, the expected 
=00 


T xf (x) dx T (x —x)* f(x) dx 
values x = ——22—____ = p and s? = —* =o”, where u and 
f(x) dx I f(x) dx 


=09 —=00 
o are the theoretical mean and standard deviation, respectively, obtained after many 
measurements. A new random variable, 


z= , (8.2) 


can be introduced to transform the entire family of normal curves into just one curve 
called “the standard normal distribution” for all values of o (see Fig. 8.2). Thus, the 
unit for z in Fig. 8.2 is to. 

The normal error curve permits an estimate of the probable magnitude of the 
indeterminate error in a given measurement provided that the standard deviation is 
known. For instance, the probability that a measurement will fall within ¢ standard 

+to t 
deviations is P (with to) = f Í f(x)dx = — e77 /2 dz = normal error 
—to T J—t 
integral. For the second integral, the transformation in Eq. (8.2) has been used. In 
particular, P (with to) = 68.3%, that is, the probability that a measurement will fall 
within one standard deviation of the true answer is 68.3%. Thus, regardless of the 
standard deviation, the normal curve in Eq. (8.1) and Fig. 8.2 has the property that 


68.3% of the area beneath the curve lies within one standard deviation (+10) of the 
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FIGURE 8.2 


Normal error curve and standard normal error curve obtained with the transformation in 
Eq. (8.2). 


mean, n, (in other words, 68.3% of the values lie within these boundaries). Likewise, 
95.5% of all values lie within +20, and 99.7% of the values lie within +30. Thus, if 
o is available, the chances are 68.3 out of 100 that the indeterminate error associated 
with any given measurement is smaller than +1o, and 95.5 out of 100 that the error 
is less than 20, and so forth. 


For a very large set of data consisting of N measurements, the standard deviation, 
o, is given in terms of the squares of the individual deviations from the mean: 


N 
>i — 4)? 
o= DE (8.3) 


The variance is equal to a, Eqs. (8.1) and (8.3) are not valid for a small number 
of replicate measurements (2 to 20) because the mean of an infinitely large number of 
measurements, u (that is, the true value of a population) is never known and, instead, 
one is forced to approximate u by the average of a small number of measurements in 
the sample, x. With a small data set, not only is x likely to differ from jz, but equally 
important the estimate of the standard deviation. This negative bias for the small data 
set can be eliminated by substituting the number of degrees of freedom (N — 1) for N 
in Eq. (8.3), so that the approximation, s, of the true value of the standard deviation 
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is given by 


N 
Da- 
s= aa (8.4) 


Since jz is unknown, the mean must be calculated from the set of replicate data which 
removes one degree of freedom in Eq. (8.4). For 20 or more data, the difference 
between Eq. (8.3) and Eq. (8.4) is generally negligible. 


Example 8.1.2. Calculate the standard deviation for the data set in Example 8.1.1. 


Solution: With x = 10.11 from the calculation in Example 8.1.1, the standard devia- 
tion from Eq. (8.4) (that is, since N = 4 < 20) is evaluated as 


a 
Dao 
i=1 


4—1 


sz 


_ (10.06 — 10.11)?+ (10.08 — 10.11)2+ (10.10 — 10.11)2+ (10.20 — 10.11)2 
7 3 
= 0.06 = +0.06. 


Note that the data are not rounded until the end of the calculation (if the data were 
rounded too early, many of the differences from x would be zero). 


For analyses that are time consuming, data from a series of samples can often be 
pooled to provide an estimate of s which is superior to the value of any given subset. 
In this calculation, the number of degrees of freedom for the pooled s is equal to the 
total number of measurements minus the number of subsets (see Example 8.1.3). 


Example 8.1.3. The mercury concentration in samples of seven fish was determined 
by the method of absorption of radiation by elemental mercury (see Table 8.2). Cal- 
culate a standard deviation for this method, based upon the pooled precision data. 


Solution: 
The values in columns 4 and 5 of Table 8.2 were calculated as follows: 


xi | (i — X11 | Oi — 41)? 
1.80 0.127 0.0161 
1.58 0.093 0.0086 
1.64 0.033 0.0011 


3|5.02 SS = 0.0258 
¥, = 1.673 = 1.67 
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Table 8.2 Pooled sample data analysis. 


Sample No. of Results Mean SS of deviation 
No. meas. Hg content (ppm) ppm from mean 
1 3 1.80, 1.58, 1.64 1.67 0.0258 
2 4 0.96, 0.98, 1.02, 1.10 1.02 0.0116 
3 2 3.13, 3.35 3.24 0.0242 
4 6 2.06, 1.98, 2.12, 2.16, 1.89, 1.95 | 2.02 0.0611 
5 4 0.57, 0.58, 0.64, 0.49 0.57 0.0114 
6 5 2.35, 2.44, 2.70, 2.48, 2.44 2.48 0.0685 
7 4 1.11, 1.15, 1.22, 1.04 1.13 0.0170 
Total = 28 Total sum of squares (SS) = 0.2196 


The other data in column 5 of Table 8.2 were obtained similarly. Then s = 
0.2196 


28 —7 
estimate of s is a good approximation of o. 


= 0.10 ppm Hg. Because the number of degrees is greater than 20, this 
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Some applications of statistics which are used by experimentalists concerning the 
effects of indeterminate error include: 


(i) the confidence interval around the mean within which the true mean can be 
expected to be found (with a certain probability), 

(ii) the number of times a measurement should be repeated in order for the experi- 
mental mean to be included (with a certain probability) within a predetermined 
interval around the true mean, 

(iii) guidance concerning whether or not an outlying value in a set of replicated re- 
sults should be retained or rejected, 

(iv) estimation of the probability that two samples analyzed by the same method are 
significantly different in composition, 

(v) estimation of the probability that a difference in precision exists between two 
methods of analysis. 


8.2.1 Confidence interval 


The area under the normal error curve encompassed by z = +1.960 corresponds to 
95% of the total area. In this case, the confidence level is 95%, so that 95 times out 
of 100 the calculated value of (x — u) for a large number of measurements will be 
equal to or less than + 1.960 (see Table 8.3). Thus, using Eq. (8.2) and solving for u: 


confidence limit for u = x + zo. (8.5) 
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Table 8.3 Confidence limits for various values of z. 


Confidence level Z Confidence level z 

50 0.67 96 2.00 
68 1.00 99 2.58 
80 1.29 99.7 3.00 
90 1.64 99.9 3.29 
95 1.96 


Example 8.2.1. Calculate the 50% and 95% confidence limits for the first entry 
(1.80 ppm Hg) in Example 8.1.3. 


Solution: 

In Example 8.1.3, s = 0.10 ppm Hg and there were sufficient data to assume s > o. 
From Table 8.3, z = +0.67 and +1.96 for the confidence levels of 50 and 95%, 
respectively. Thus, from Eq. (8.5): 


e the 50% confidence limit for u = 1.80 + 0.67 x 0.10 = 1.80 + 0.07 ppm Hg; 
e the 95% confidence limit for u = 1.80 + 1.96 x 0.10 = 1.80 + 0.20 ppm Hg. 


For example, the second result reveals that the chances are 95 in 100 that jw, the true 
mean (in the absence of determinate error) will be in the interval between 1.60 and 
2.00 ppm Hg. 


Eq. (8.5) applies to the result of a single measurement. As previously discussed, 
o characterizes the uncertainty of the separate measurements x1, ...xy. However, the 
answer Xpes; = X, where 
pple (8.6) 
N 
represents a judicious combination of all measurements, which is more reliable than 
any one of the measurements considered separately. In fact, if one were to make 
many determinations of the average of N measurements, these would be normally 
distributed about the true value jz. The width of the distribution for x is called the 
standard error or standard deviation of the mean, oz. This quantity follows from the 
general formula in (see Section 8.4) for the propagation of random errors: 


a 2 = 2 
ox Ox 

o= (Zon) tat (Eos (8.7) 
Ox] OxN 


Since all x1, ..., xy are measurements of the same quantity, their widths are all equal, 
that is, oy, =... = Oxy =o. Also from Eq. (8.6), the partial derivatives are the same: 
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Ox Ox 1 . ; 
— =...= — = —. Therefore, using these results, Eq. (8.7) yields 
Ox] oxy N 


n= j(i ++(4e) -E-Z ae 


Thus, for the average of N replicate measurements, the more general form of Eq. (8.5) 
is 


ZO 
confidence limit for y = x + —. (8.9) 
VN 
Example 8.2.2. Calculate the 50% and 95% confidence limits for the mean value 
(1.67 ppm Hg) for sample 1 in Example 8.1.3. 


Solution: 

Again, s > o = 0.10 ppm Hg. For the three measurements using Eq. (8.9): 
a 0.67 x 0.10 

e the 50% confidence limit for u = 1.67 + a = 1.67 + 0.04 ppm Hg; 
te 1.96 x 0.10 

e the 95% confidence limit for u = 1.67 + : oe = 1.67 +0.11 ppm Hg. 


Example 8.2.3. Based on the previous calibration of a 10-mL pipette in Table 8.4, 
calculate the number of replicate measurements needed to decrease the 95% confi- 
dence interval for the calibration of a 10-mL pipette to 0.005 mL, assuming a similar 
procedure has been followed. 


Table 8.4 Replicate measurements from the calibration of a 10-mL pipette. 

Trial Volume of Trial Volume of Trial Volume of 
water delivered water delivered water delivered 
(mL) (mL) (mL) 

1 9.990 9 9.988 17 9.977 

2 9.993° 10 9.976 18 9.982 

3 9.973 11 9.981 19 9.974 

4 9.980 12 9.974 20 9.985 

5 9.982 13 9.970? 21 9.987 

6 9.988 14 9.989 22 9.982 

7 9.985 15 9.981 23 9.979 

8 9.970? 16 9.985 24 9.988 


Mean volume = 9.9816 = 9.982 mL; median volume = 9.982 mL. 

Average deviation from mean = 0.0052 mL = 0.005 mL. 

Spread = 9.993 - 9.970 = 0.23 mL; standard deviation = 0.0065 mL = 0.006 mL. 
a Maximum value. 

> Minimum value. 


a 
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Solution: 

The standard deviation for the measurement is 0.0065 mL (Table 8.5). Because s is 
based on 24 values, one may assume s —> ø = 0.0065. Using Eq. (8.9), the confidence 
interval is given by 


confidence interval = +——. (8.10) 


0.0065 


Therefore, 0.005 mL = 1.96 x I? which yields N = 6.5. Thus, by employing 


the mean of seven measurements, there would be a somewhat better than 95% chance 
of knowing the true mean volume delivered by the pipette to 0.005 mL. 


Table 8.5 Standard deviation calculations. 


Entry Trial number Mean Standard deviation Standard deviation 
(Table 8.4) (mL) (mL) (o) (Eq. (8.3)) (mL) (s) (Eq. (8.4)) 
1 1-24 9.982 0.0063 0.0065 
2 1-12 9.982 0.0070 0.0074 
13-24 9.982 0.0056 0.0058 
mean 9.982 0.0063 0.0066 
spread 0.000 0.0014 0.0016 
3 1-3 9.9854 0.0088? 0.0108? 
4-6 9.983 0.0034 0.0042 
7-9 9.981 0.0079 0.0096 
10-12 9.977° 0.0029° 0.0036° 
13-15 9.980 0.0078 0.0095 
16-18 9.981 0.0033 0.0041 
19-21 9.982 0.0057 0.0070 
22-24 9.983 0.0037 0.0046 
mean 9.982 0.0054 0.0067 
spread 0.008 0.0059 0.0072 


a Maximum value. 
> Minimum value. 


Limitations in the amount of time or the number of available samples may pre- 
clude an accurate estimate of o. Thus, it may be necessary that a single set of replicate 
measurements must provide not only a mean value but also a precision estimate of s. 
In this case, the t parameter is used in place of z, where (compare with Eq. (8.2)) 


(8.11) 
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In general, the ¢ distribution (or Student’s t distribution) is flatter than the normal 
probability distribution, causing more area to be contained in the tails of the distribu- 
tion (see Fig. 8.3). 


f(t) 
=ë 

= - -N=2 
----- N=S 


— N=% (normal) 


FIGURE 8.3 


Comparison of the Student ¢ distribution with the normal distribution. 


The parameter t is dependent not only on the desired confidence level but also on 
the number of degrees of freedom. As the number of degrees of freedom becomes 
infinite (more specifically, > 30), the values of t in Table 8.6 approach the values of z 
in Table 8.3. The ¢ distribution yields a larger confidence interval in order to account 
for the greater variability in s. The “central limit theorem” guarantees the sample 
means distribution will always be normally distributed if N > 30 regardless of the 
population distribution; however, care must be taken in applying the ¢ distribution 
to small sample sizes, where it must be ensured that the measured data are normally 


distributed. Thus, the confidence limit analogous to Eq. (8.9) is given by 


t 
confidence limit for u = z+ ——. (8.12) 


VN 


Example 8.2.4. A chemist obtained the following data for the alcohol content in a 
sample of blood: ethanol concentration = 0.084%, 0.089%, and 0.079%. Calculate 
the 95% confidence limit for the mean assuming (a) no additional data about the 
precision of the method and (b) that on the basis of previous experience s > o = 
0.006% ethanol. 


Solution: 
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Table 8.6 Values of t for various levels of probability. 


Degrees of Factor for confidence interval (%) 
freedom 80 90 95 99 99.9 
1 3.08 6.31 12.7 63.7 637 
2 1.89 2.92 4.30 9.92 31.6 
3 1.64 2.35 3.18 5.84 12.9 
4 1.53 2.13 2.78 4.60 8.60 
5 1.48 2.02 2.57 4.03 6.86 
6 1.44 1.94 2.45 3.71 5.96 
7 1.42 1.90 2.36 3.50 5.40 
8 1.40 1.86 2.31 3.36 5.04 
9 1.38 1.83 2.26 3.25 4.78 
10 1.37 1.81 2.23 3.17 4.59 
11 1.36 1.80 2.20 3.11 A 
12 1.36 1.78 2.18 3.06 4.32 
13 1.35 1.77 2.16 3.01 4.22 
14 1.34 1.76 2.14 2.98 A 
00 1.29 1.64 1.96 2.58 3.29 
(a) We have x = ea — maul = 0.084 and 


= 0.005. 


oe i (0.000)? + (0.005)? + (0.005)? 
a 3-1 


Table 8.6 indicates that tf = +4.30 for two degrees of freedom at a 95% confi- 
dence. Hence, the 95% confidence limit is 0.084 + 4.3 x 0.005//3 = 0.084 + 
0.012. The confidence limit at 95% can also be easily calculated with the Excel 
software package (Microsoft Office 365, build version 2018). Here one enters 
the three data points into a single column A (that is, Al:A3). By selecting 
“Formulas” in the top menu of Excel — “More Functions” — “Statistical” 
— “STDEV.S” and highlighting the three numbers in the Number 1 text box 
(A1:A3), a standard deviation value of 0.005 is similarly calculated. Subse- 
quently, choosing the function “CONFIDENCE.T” and entering in the three 
pop-up text boxes (i) a value of Alpha = 1 — 0.95 = 0.05 (for a 95% confidence 
limit), (ii) the previous Standard_dev = 0.005 and a Size = 3, a confidence 
interval for a population mean using a Student’s ¢ distribution of 0.01242 is 
evaluated, identical to the previous hand calculation. Finally, using the func- 
tion “AVERAGE,” a value of 0.084 is calculated for the three numbers. This 
analysis thereby yields the same previous result for the 95% confidence limit of 
0.084 + 0.012%. 


es = 
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(b) Because a good value of o is available, the 95% confidence limit = 0.084 + 


1. j 
T = 0.084 + STA = 0.084 + 0.007%. Note that a more accurate 


knowledge of o decreased the confidence interval by almost half. 


8.2.2 Rejection of data 


When an outlying result occurs in a data set, a decision must be made whether to 
reject it. In this case, the Q test can be used (although a cautious approach to the 
rejection of data should be adopted). Here the difference between the questionable 
result and its next nearest neighbor is divided by the spread of the entire data set; the 
resulting ratio is then compared to a critical value for a particular degree of confidence 
(see Table 8.7) (Dixon, 1951). 


Table 8.7 Critical values for rejection values for Q. 


Number of Ocrit (reject if Qexp > Qerit) 

observations | 90% Confidence | 96% Confidence | 99% Confidence 
3 0.94 0.98 0.99 

4 0.76 0.85 0.93 

5 0.64 0.73 0.82 

6 0.56 0.64 0.74 

T 0.51 0.59 0.68 

8 0.47 0.54 0.63 

9 0.44 0.51 0.60 

10 0.41 0.48 0.57 


Example 8.2.5. The analysis of a calcite sample yielded CaO contents of 55.95%, 
56.00%, 56.04%, 56.08%, and 56.23%. Should the last value that appears anomalous 
be retained? 


Solution: 

The difference between 56.23 and 56.08 is 0.15. The spread is (56.23 — 55.95) = 
0.28. Thus, Qexp = 0.15/0.28 = 0.54. For five measurements, Qcrit is 0.64 (at 90% 
confidence). Because 0.54 < 0.64, the last value should be retained. 


8.2.3 Hypothesis testing for experimental design 


Hypothesis testing can be used in scientific and engineering analysis processes as a 
basis for experimental testing. In this approach, a null hypothesis is employed, which 
assumes that the numerical quantities being compared are the same. The probabili- 
ties of the observed differences appearing as a result of indeterminate error are then 
computed from statistical theory. Usually, if the observed difference is as large as or 
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larger than a difference, which would occur at a specific probability level (for exam- 
ple, 5 times out of 100 for a 5% probability level), the null hypothesis is rejected and 
the difference is judged to be significant. 


Comparison of an experimental mean with a true value 

A judgment may have to be made whether the difference between an experimen- 
tal mean and a true value yz is the consequence of indeterminate (random) error in 
the measurement or of the presence of determinate (systematic) error in the method. 
The statistical treatment for this type of problem involves comparing the difference 
(x — u). If this value is larger than the critical value expected for indeterminate error, 
it may be assumed that the difference is real (that is, rejection of the null hypothesis) 
and that a determinate error exists. The critical value for rejection of the null hypothe- 
sis can be obtained from Eq. (8.12) (or equivalently from Eq. (8.9) if a good estimate 


of ø is available) such that 
z - (8.13) 
X — u= Ł—, s 
VN 


where N is the number of replicate measurements employed in the test. 


Example 8.2.6. A new procedure for the rapid analysis of sulfur in kerosenes was 
tested by the analysis of a sample which was known from its method of preparation 
to contain 0.123% S. The results were as follows: sulfur content = 0.112%, 0.118%, 
0.115%, and 0.119%. Do the data indicate the presence of a negative determinate 
error in the new method? 


Solution: 
.112 +0.11 11 ili 
3-0 ed tle ?) _ 0.116 and 
4 
0.004)? + (0.002)? + (0.001)? + (0.003)? 
ay 4 ( Fr 2+ (0.003)? O 0033. 


Also x — u = 0.116 — 0.123 = —0.007. Table 8.6 indicates that t = 3.18 for three 


t 3.18 x 0.0033 
degrees of freedom and 95% confidence. Hence, = = = = +0.0052, 


as compared to x — u = —0.007. This analysis indicates that five times out of 100, 
an experimental mean can be expected to deviate by +0.0052 or more. Thus, if 
one concludes that —0.007 is a significant difference and that a determinate error 
is present, one will, on average, be right 95 times out of 100 judgments (or equiva- 


lently, wrong five times out of 100). On the other hand, using a 99% confidence level 
t 
(with t = 5.84), a +0.0096. Thus, if one insists on being wrong no more than 


one time out of 100, one would have to say that no difference has been demonstrated 
between the original and new method (note this statement is different from saying 
that no determinate error exists). 
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Comparison of two experimental means 


A chemical engineer frequently employs analytical data to establish whether two 
materials are identical. If Nı replicate analyses are made on material 1, yielding an 
experimental mean x1, and N2 analyses on material 2, yielding a mean x2, Eq. (8.13) 
gives 


ae and EE te ge. T, 
Hi 5X1 E and U? = X2 E X1 — X2 = TIS, — Z. . 
WANE JNa Nı N2 


Equivalently, in accordance with Eq. (8.9), t and s can be replaced by z and o 
if a good estimate of ø is available. The last relation follows if one considers the 
null hypothesis that u; and u2 are identical. The number of degrees of freedom for 
finding ¢ in Table 8.6 is defined as N; + N2 — 2. If the experimental difference xı — x2 
is smaller than the computed value at a given confidence level, the null hypothesis is 
accepted and no significant difference between the means has been demonstrated. 


Example 8.2.7. The composition of a flake of paint found on the clothes of a hit-and- 
run victim was compared with paint from a car suspected of causing the accident. Do 
the following data for the spectroscopic analysis of titanium in the paint suggest a 
difference in composition between the two materials? From previous experience, the 
standard deviation for the analysis is known to be 0.35% Ti (that is, s > o = 0.35% 
Ti). The following data were obtained. In paint from the clothes, titanium content 
= 4.0% and 4.6%; and in paint from the car, titanium content = 4.5%, 5.3%, 5.5%, 
5.0%, and 4.9%. 


ee 4.0 +4.6 454+5.34554+5.0+49 
We have Hj = OT4O L43 and =“ Slee TOUT D _ 50, 


5 
Hence, xı — x2 = 4.3 — 5.0 = —0.7% Ti. Using z and o in Eq. (8.14), with z val- 
ues from Table 8.3 for 95% and 99% confidence levels, yields 


J2+5 

= N, + Np 7 +1.96 x 0.35 a = +0.57 for 95% confidence, 

E Ni N2 2+5 

+2.58 x 0.35,/ —— = +0.76 for 99% confidence. 
2x5 

Thus, 5 out of 100 data should differ by 0.57% or greater and only one out of 100 

should differ by as much as 0.76% Ti. Thus it seems reasonably probable (between 

95% and 99% certain) that the observed difference of —0.7% does not arise from 

indeterminate error but in fact is caused by a real difference between the two paint 

samples. Hence, the suspected vehicle was probably not involved in the accident. 


Comparison of precision measurements 

An F test may be used to determine whether (i) Method 1 is more precise than 
Method 2 or (ii) there is a difference in the precision of two methods. The null hy- 
pothesis for the F test assumes that the precisions are identical. In this test, the ratio 
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of the variances of the two measurements is compared with a critical value of F given 
in Table 8.8; if the experimental value exceeds the critical value, the null hypothesis 
is rejected. For procedure (i), the variance of the supposedly more precise procedure 
is always placed in the denominator; for procedure (ii), the larger variance always 
appears in the numerator. 


Table 8.8 Critical F values at the 5% level. 

Degrees of Degrees of freedom (numerator) 

freedom 

(denominator) | 2 3 4 5 6 12 20 es 

2 19.00 | 19.16 | 19.25 | 19.30 | 19.33 | 19.41 | 19.45 | 19.50 
3 9.55 9.28 9.12 9.01 8.94 8.74 8.66 8.53 
4 6.94 6.59 6.39 6.26 6.16 5.91 5.80 5.63 
5 5.79 5.41 5.19 5.05 .95 .68 4.56 4.36 
6 5.14 4.76 1.53 1.39 .28 .00 3.87 3.67 
12 3.89 3.49 3.26 3.11 3.00 2.69 2.54 2.30 
20 3.49 3.10 2.87 2.71 2.60 2.28 2.12 1.84 
oo 3.00 2.60 2.37 2.21 2.10 1.75 1.57 1.00 


Example 8.2.8. A standard method for the determination of carbon monoxide lev- 
els in gaseous mixtures is known from many hundreds of measurements to have a 
standard deviation of 0.21 ppm CO. A modification of the method has yielded an s of 
0.15 ppm CO for a pooled set of data with 12 degrees of freedom. A second modifica- 
tion, also based on 12 degrees of freedom, has a standard deviation of 0.12 ppm CO. 
(a) Is either of the modifications significantly more precise than the original? (b) Is 
the precision of the second method significantly better than the first? 


Solution: 


(a) Because an improvement is claimed, the variances of the modifications are 
sza _ (0.21)? 


s2 (0.15)? 


placed in the denominator. Method 1 gives Fı = 


= 1.96, and 


seq _ (0.21? 
s2 (0.12? 
and the number of degrees of freedom for the numerator can be taken as infinite, 
yielding a critical F value of 2.30 from Table 8.8. Thus, the F value for the first 
method is less than this critical value, and the null hypothesis is accepted at the 
95% probability level (that is, Method 1 has the same precision as the original). 
However, the second method does appear to have a significantly greater preci- 
sion. 


Method 2 gives F> = = 3.06. For the standard procedure, s > o, 


a “ee ee 
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s? (0.15)? 
s2 (0.12? 
value of 2.69 (for 12 degrees of freedom in the numerator and denominator of 
Table 8.8), indicating no significant difference between the two methods. 


(b) On the other hand, the ratio F} = = 1.56 is lower than the critical 


Analysis of variance (ANOVA) 


In the previous section, two populations were compared for an estimation of the pre- 
cision between two methods with an F statistical test. For populations greater than 
two, an analysis of variance (ANOVA) can be used to examine differences among 
group means. This test provides a way to assess if each mean likely came from a 
larger overall population. The ANOVA test evaluates a variance ratio. As depicted in 
Fig. 8.4, this ratio considers the distances from the sampled distribution means to the 
“overall” mean (that is, the variance between) relative to the internal spread around 
each mean (the variance within). The ratio of these two quantities is then compared 
to a critical F statistic (see Table 8.8). This statistic allows a test of the null hy- 
pothesis assuming the means (u) are from a larger population where, for example, 
Ao: 1 = u2 = 3. If the ratio of the variances for the between to within values is 
significantly greater than Ferir, then one rejects the null hypothesis Hp. As such, this 
would indicate that at least one mean is an outlier and each distribution is narrow and 
distinct from one another. On the other hand, if this ratio is much less than F;,;;, then 
one fails to reject Ho, indicating that the means are very close to the overall mean 
and/or the distributions overlap. 


Variance Between + Variance Within = Total Variance 
Overall mean 


Variance Between 


Internal 
spread 


Variance Within 


Distance from overall 
mean 


FIGURE 8.4 
Schematic of an ANOVA analysis (adapted from Foltz, 2020). 


Following Foltz (2020), a hand calculation for data analysis of three sample meth- 
ods is shown in Example 8.2.9. This analysis is easily duplicated using the ANOVA 
statistical function in Excel, as demonstrated in Example 8.2.10. 


Example 8.2.9. Consider three methods developed for a chemical analysis. Seven 
observations are made for each method with the following concentrations (in ppm): 
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(i) Method 1: 8.2, 9.3, 6.1, 7.4, 6.9, 7.0, 5.3; (Gii) Method 2: 7.1, 6.2, 8.5, 9.4, 7.8, 6.6, 
7.1; and (iii) Method 3: 6.4, 7.3, 8.7, 9.1, 5.6, 7.8, 8.7. Perform an ANOVA analysis 
to determine if there is any significant difference among the methods. 


A B C D E F G H 
Method 1 Method 3 

8.2 7.2 64 

3.3 6.2 73 

61 8s 87 

74 $4 93.1 

6.9 78 5.6 

7.0 66 78 

5.3 7.2 8.7 

Measured Concentrations (pom) 
717 7.53 1.4 745 
Mean 1 Meon 2 Mean3 Overall Mecon 


eCnank wn = 


fotal no. of observations (N): 
O. of groups/methods (C): 
0. of observations in each group (n): 


(a) Degrees of Freedom (of) (c) Mean Sum of Squares (MS) (A) F test 
Soman C-1 . VAR.S[A10:C10)*(C-1}*n [MSC* — SSC/CF genes MSC/MSE 
VAR.S(A2:A8)*(f-2}> 
VAR.S{B2:88)"(f-2}+ 
VAR.S{C2:C3)*(-1) = a=  FANV.RT(0.05, dha. Cann) 


SSC+SSE 


FIGURE 8.5 


Example calculation of an ANOVA. 


Solution: 

In panel A of Fig. 8.5, the three separate data sets (C = 3) are input into an Excel 
spreadsheet, where the means are evaluated accordingly. Since each group has the 
same number of data points (n = 7), the “overall” mean can be taken either by av- 
eraging the resultant averages for each of the three groups or by averaging all of the 
observations (with N = 21). An ANOVA assessment is performed below the data sets 
for the two variation sources for the “between” and “within” groups. In this analy- 
sis, the number of degrees of freedom (df), the sum of squares (SS), and the mean 
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sum of squares (M S) are evaluated with the formulas given in the footnote entries. 
Here the “C” and “E” designations (that is, SSC and SSE for the sum of squares and 
MSC and MSE for the mean sum of squares) refer to a “column (C)” or “error (E)” 
quantity, respectively, for the two sources of variance groups. 


To simplify the calculation for the sum of squares, the Excel variance function 
(VAR.S) is employed (which is the square of the standard deviation in Eq. (8.4)). To 
recover the sum of squares, this function must be multiplied by appropriate factors 
for the degrees of freedom for the between and within groups. The mean sum of 
squares (MS) for a given group is simply the sum of squares values (SS) divided 
by the number of degrees of freedom (df). Finally, the F parameter is a ratio of the 
sources variances for the between and within groups for the mean sum of squares (that 
is, F = MSC/MSE). The F value is then tested against a critical value Frit. This 
latter quantity can be obtained either from Table 8.8 or using the Excel function at a 
95% confidence level KINV.RT(0.05, df for the numerator, df for the denominator). 
In this example, since F < Ferit, one fails to reject the null hypothesis, indicating no 
significant difference in the means for the three methods. 

In contrast to a one-way analysis, a two-way ANOVA test can be further per- 
formed if subgroups exist in the data sampling. For example, if observations are made 
at two different temperatures as well in each chemical process, this additional type of 
test could be employed. An F test simply indicates a significant difference between 
the groups but it does not tell where the differences lie. One must then perform a post 
hoc test. Such analyses may include a Bonferroni correction, a Tukey honest signif- 
icant difference test, or a least significant difference test. The post hoc tests explore 
the differences between the groups while controlling the experimental error rate. 


Example 8.2.10. Perform an analysis for the data in Example 8.2.9 using the Excel 
ANOVA statistical function. 


Solution: 

In Excel, select the “Data” tab in the upper ribbon and choose “Data Analysis” 
at the top right corner of the worksheet. A “Data Analysis” pop-up box appears. 
In Fig. 8.6(a), select the “Anova: Single Factor” entry for a one-way ANOVA 
analysis in “Analysis Tools” and click the “OK” button. Another pop-up box as 
Fig. 8.6(b) appears. Select the entire input range of data including the headings from 
$A$1:$C$8 (the heading will be reflected accordingly in the Excel output table). 
Choose “Grouped By: Columns” and check the “Labels in the First Row.” Input an 
“Alpha” value of 0.05 (for a 95% confidence level for the Ferit value). Select the cell 
location for output in the “Output Range” entry and click “OK.” The Excel output ta- 
ble in panel B of Fig. 8.5 results. These results match precisely the calculated results 
in panel A. This latter methodology provides a simple way to perform an analysis 
with the Excel ANOVA function. 
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Data Analysis ? x 
Analysis Tools 
Anova: Two-Factor With Replication Cancel 
Anova: Two-Factor Without Replication 
Correlation 
Covariance Help 
Descriptive Statistics 
Exponential Smoothing 
F-Test Two-Sample for Variances (a) 
Fourier Analysis 
Histogram v 
Anova: Single Factor ? 
Input 
Input Range: $A$1:$C$8 * 
Cancel 
Grouped By: © Columns 


O Bows 
V) Labels in first row 


Alpha: 005 


Ope E $a$26 t 
New Worksheet Ply: (b) 
New Workbook 


FIGURE 8.6 


Sample calculation using the ANOVA Excel function (a) with the Data Analysis dialog for (b) 
a one-way analysis. 


8.3 Regression analysis and software applications 


In the derivation of a calibration curve, for example, it may be necessary to derive 
a “best” straight line through a given set of data points (which only contain ran- 
dom error). This type of regression analyses is handled through scientific software 
packages such as SigmaPlot and SigmaStat (Systat Software, Inc., 2018), as well as 
Excel. All packages easily handle linear regression and some nonlinear functions. 
SigmaPlot also uses a more general model fitting algorithm for nonlinear fittings us- 
ing a Marquardt—Levenberg algorithm (see Press et al., 1986). 

In a linear regression analysis, it is assumed that there is a linear relation between 
the dependent variable, y, and the independent variable, x: 


y=at be. (8.15) 


The task at hand is therefore to determine the intercept, a, and the slope, b, for the 
best-fit straight line through the scattered measured data. As shown in Fig. 8.7, the 
residuals, which represent the vertical deviations (d) between the experimental data 


and the least-squares line, must be minimized. This type of fitting is obtained with 
the following equations: 


AB z - 
b= -g anda=y— bx, (8.16) 


re 
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Predicted value 
| d, (deviation or residual) 


x 
Observed value 


ag Intercept 


xy 


FIGURE 8.7 


Linear regression analysis. 


where the regression quantities are defined as 
AB=) i -5) Qi- 5) = J ziyi — — (8.17a) 
2 = (xj —#? = Dix? - (Ea) = ox? —ni, (8.17b) 
2= Soi - wy => oy - ox) =) y-n). (8.17c) 


Here, x; and y; are individual pairs of data for the n data points so that for the sum- 
mations i goes from i = | to i =n. The quantities x and y are the average values of 
the variables: 


l and y= S. (8.18) 


The variation of data about the regression line can be quantified in terms of the stan- 
dard error of estimate: 


Se = 


.1 
n—2 n—2 ee) 


Eoy / B2 — b2 A? 

The quantity (y; — y) in the first term of Eq. (8.19) is the vertical deviation d; for 
point i in Fig. 8.7. The number of degrees of freedom in Eq. (8.19) is two less than 
the number of points because one degree of freedom is lost in calculating a and one 
for b. The quantity Se can be used to predict the confidence intervals for the regression 
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line estimate. In particular, the standard error for the slope b is given by 


as ey = F, (8.20) 


The confidence limit for the slope can be derived using sp in Eq. (8.20) such that 


confidence limit for slope = b + tsp. (8.21) 


The prediction interval for y (at a given value of x*), using se in Eq. (8.19), is given 
by 


Gan) 


x (8.22) 


prediction interval for y = y + sef 1+ 1 + 
Both Eq. (8.21) and Eq. (8.22) are evaluated with a ¢t value for a desired confidence 
level and n — 2 degrees of freedom (Table 8.6). 

A further quantity of interest is the correlation coefficient, which determines the 
strength and magnitude of the relationship between the variables x and y. The symbol 
r is used for the correlation coefficient for the data sample (and p for the population). 
The most popular one is the Pearson product moment correlation coefficient, defined 
as 


nÐ xivi — oxi yi AB 
{baa J (Exi)"] [ Do 20 ad 


The range of r is from —1.0 to +1.0. A strong negative linear relationship exists 
when r is near —1, and a strong positive relationship exists when r is close to +1. 
Therefore, the closer |r| (or r°) is to 1, the more confident one is that a straight line 
represents the data. 


r= 


(8.23) 


Example 8.3.1. The first two columns of Table 8.9 contain the experimental data 
for the calibration curve for the peak area (y) versus the concentration of isooctane 
(in mol%) (x) based on a chromatographic analysis. Perform a linear least squares 
analysis of the data. 


Solution: 

Columns 3, 4, and 5 of Table 8.9 contain the computed values for (x;)?, ODP, and 
x;y; (and their sums). Thus, Eq. (8.17a), Eq. (8.17b), and Eq. (8.17c) yield AB = 
15.81992 — (5.365 x 12.51 / 5) = 2.39669; A? = 6.90201 — (5.365)? / 5 = 1.14537; 
and B? = 36.3775 — (12.51)? / 5 = 5.07748. Substitution of these values into 
Eq. (8.16) and Eq. (8.18) gives the regression coefficients b = 2.39669 / 1.14537 = 
2.0925 = 2.093 and a = (12.51 / 5) — (2.0925 x 5.365 / 5) = 0.2567 = 0.257. The 
equation for the straight fitted line from Eq. (8.15) is therefore y fit = 0.257 + 2.093x. 


ed 
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Table 8.9 Calibration data for a chromatographic method. 


Mol% Peak (xi)? (i)? xi yi 
isooctane, x; area, yi 

0.352 1.09 0.12390 1.1881 0.38368 
0.803 1.78 0.64481 3.1684 1.42934 
1.08 2.60 1.16640 6.7600 2.80800 
1.38 3.03 1.90440 9.1809 4.18140 
1.75 4.01 3.06250 16.0801 7.01750 
5.365 12.51 6.90201 36.3775 15.81992 


Using Eq. (8.19) and Eq. (8.20), the standard errors for the estimate and slope are, 
respectively, 


5.07748 — (2.0925)? x 1.14537 (0.144)? 
= = 0.144 and sp = | ———~ = 0.135. 
me / 5—2 and se = V 714537 


The confidence limit for the slope follows from Eq. (8.21) and Table 8.6, where for 
a 95% confidence level and 5 — 2 = 3 degrees of freedom: 95% confidence limit of 
slope = 2.093 + (3.18 x 0.135) = 2.093 + 0.429. 


Similarly, using Eq. (8.15) and Eq. (8.22), the prediction interval for y at a 95% 
confidence level, at the point x* = x; = 0.352 (Table 8.9), is 


(0.2567 + 2.093 x 0.352) + 3.18 x 0.144 1+ : + (5.365/5 0.352)? 
d : x U. +35.16 xÜ. x 
5 1.14537 


= 0.993 + 0.590. 


2.39669 
The correlation coefficient is calculated as r = = 0.994 or 
/1.14537 x 5.07748 


(r? = 0.988). This value of r indicates a strong (positive) linear relationship. These 
labor-intensive calculations can also be performed easily in Excel (see Fig. 8.8). The 
95% confidence intervals for both the population (that is, prediction interval) and the 
regression line are shown in Fig. 8.9. These results are obtained with a spreadsheet 
analysis using the Excel top menu tab “Formulas” — “More Functions” — “Statis- 
tical.’ The two function choices are employed in the drop down menu: “LINEST” 
(for the regression analysis) and T.INV (for the value of the Student ¢ distribution at 
a 95% probability, that is, œ = 0.05). 

For the LINEST calculation, one selects the cells for output. The input for the di- 
alog boxes of this function are: y-column values; x-column values; TRUE (indicates 
a line of the form y = a + bx with a nonzero intercept); and TRUE (to list the esti- 
mates). After the dialog boxes are filled, one holds down the three keys “control + 
shift + enter” and results automatically fill the selected area. The output includes the 
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A D E F 
1 Mol% 
2 x 
3 (Calc 1: eq(8.15)) 
4 
=a 
6 | 
7 
8 [Cale 1: Yiesrewion for C3 =$BS11*A3+SCS11 
9 
10 | Regression (=LINEST(B3:87,A3:A7, TRUE, TRUE), then type: cntrl+shift+enter) 
11 | 2.093 0.257] a [tēmnvoos, ca) | 3.48] 
12 | oa35f oss| s, [Ab sFs1798$12) | 0.429] 
13 oase) ora s [esms | osoa 
14 |F-statistic af a| DoF [Fem RINV.RTIO.05,2,C14)]| 10.13] 
15 [regression SS | 5.015]  0.0624| residuals | 
16 
17 Average x 1.073 
18 |A? 1.14537 
19 
20 
21 Veet AY 
22 | (Calc 2 - see text)| 0.371 1.364 
23 0.236 2:173 
24 | 
25 
26 | 0.356 4.274 3.563 
27 |Calc 2: Ay for B22=$F$11*$C$13*SQRT((1/($C$14+2))+(($B$17-A3)^2)/$B$18) 
28 
29 
30 | Ay Ym + AY Ym “AY 
31 | (Calc 3: eq(8.22))} 0.590 1.584 0.403 
32 | 
33 
34 
35 | 
36 |Calc 3: Dy for B831=SFS11*$C$13*SQRT((1+1/($C$14+2)}+(($B$17-A3)*2)/SBS18) 
37 | 
FIGURE 8.8 


Excel spreadsheet analysis for linear regression. 


slope (b) and intercept (a) of the fitted regression line and the standard errors of the 
latter parameters (Ab and Aa), as well as the standard error for the estimate (se). The 
number of degrees of freedom, the F-statistic, the regression sum of squares, and the 
residual sum of squares are further tabulated. 
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Data 
——— regression line 


= — —95% Prediction Interval (Lower) 


Peak Area, y 


= — — 95% Prediction Interval (Upper) 
sessssses 95% Confidence Interval (Upper) 
sessssses 95% Confidence Interval (Lower) 


0 0.5 1 15 2 


Mole % Isooctane, x 


FIGURE 8.9 


Graph of calibration curve with confidence and prediction intervals shown. 


As mentioned, the r°? value is generally used as a popular indicator of the good- 
ness of fit. This quantity can be directly estimated from the total sum of squares (SS) 
and the regression sum of squares (SS): 


n 
X Oii -5 


2 regression SS ; 


total SS de a 
Xo- 
i 


where y; fit is the predicted value of y for a given value of x; using the equation 
for the fitted regression line. Although a value close to one indicates a good fit, an 
even better statistical test of the goodness of fit is the Fisher F-statistic. This statistic 
involves the ratio of the variances (its value is given in Fig. 8.8): 


3 


=2 
_ variance explained regression SS / vı _ (ou.si =y) ) / vi 
~ variance unexplained residual SS / v2 (Xo =" fin?) 7 M; 


The statistic is used under the null hypothesis that the data are a random scatter of 
points with a zero slope. The critical value of the F-statistic can also be calculated 
with Excel. If the F-statistic > Ferit, the null hypothesis fails and the linear model 
is significant. For the degrees of freedom vı = 1 and v2 = n — k, where k is the 
number of variables in the fitted regression analysis including the intercept and n is 
the number of data points. The value for v2 is the number of degrees of freedom. 
For this example, vı = 1 and v2 = 3, so that the Ferit value is evaluated as 10.13, as 
shown in Fig. 8.8 using the Excel function “RINT-RT” (with œ = 0.05, vı = 1, and 
v2 = number of degrees of freedom). The F-statistic in the example is 241, which is 
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much greater than Ferit, so that one is 95% sure that the data are not a random scatter 
of points and that the linear regression is justified. 

The confidence interval for the regression line is calculated with Eq. (8.22) but 
neglecting the factor of unity in the square-root expression. The intervals in Fig. 8.9 
become wider as the value of x is further from x. In the Excel analysis, the stan- 
dard errors for the slope and intercept are also available, as listed as sp and sq in 
Fig. 8.8. As expected, the values of the regression coefficients calculated above for 
a,b, Se, Sp,r2, and confidence intervals agree perfectly with the Excel analysis re- 
sults. 


8.4 Propagation of errors 


Numerical problems are important in engineering for solving problems on large (or 
small) computers where errors can occur and propagate. 


Significant digits and error 
Most digital computers represent numbers in either a fixed system (that is, a fixed 
number of decimal places such as 62.358, 0.013, 1.00, and so on) or a floating point 


system (that is, a fixed number of significant digits such as 0.6238 x 10°, 0.1714 x 
1073, —0.2000 x 10!, etc.). The significant digit of a number c is any digit of c, 
except possibly for zeros to the left of the first nonzero digit that serves only to fix the 
position of the decimal point. For example, 1360, 1.360, and 0.001360 all have four 
significant digits. 


An error will be caused by chopping (i.e., discarding all decimals from some 
decimal on) or rounding. A good rule of thumb for rounding is to discard the (k + 1)th 
decimal and all subsequent decimals as follows: 


(a) If the number thus discarded is less than half a unit in the kth place, then leave 
the kth decimal unchanged (“rounding down”). 

(b) If it is greater than half a unit, add one to the kth decimal (“rounding up”). 

(c) If it is exactly half a unit, round off to the nearest even decimal. For example, 
rounding 3.45 and 3.55 to one decimal gives 3.4 and 3.6, respectively. 


Rule (c) ensures that in discarding exactly half a decimal, rounding up and down typ- 
ically happen equally on average. For technical simplicity, computers use rounding 
(by rounding up in case (a)), or in some case chopping. 


The final results of computations are approximations because of round-off (or 
chopping) errors, experimental errors, or truncation errors (for example, truncating 
an infinite series). If @ is an approximate value of a quantity whose exact value is a, 
the error is simply the difference 
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or a = & + € (i.e., the true value equals the approximate value plus error). The 
relative error €, of & is defined by 


€ a-a error 


(a#0). 


a a true value 


es : ; E ; A 
If |e] << |a|, the relative error can be approximated by e, ~ =. In practice € is 


a 


unknown; however, one can get an error band for a such that |e| < 6; hence, |a—a| < 


B. Similarly, for the relative error, |€,| < +, and hence = <fr. 


8.4.1 Error propagation 


Errors in a computation can propagate and affect the accuracy of the result. 


(a) In addition and subtraction, an error bound for the results is given by the sum of 
the error bounds for the terms. 

Derivation: Let x =x + €1, y = Y + €2, where |€1| < £1 and |e2| < 62. Then the 
error difference is |e] = |x — y — (& — y)| = le1 — €2| < |e1| + le2l < Bi + £2. 
The proof for the sum is similar. 

In multiplication and division, an error bound for the relative error of the results 
is given (approximately) by the sum of the error bounds for the relative errors of 
the given numbers. 

Derivation: For the relative error €, of xy one gets from the relative errors €,1 
and €,2 of x, y and bounds 6,1 and 8,2: 


(b 


~~ 


ei xy—xy|_ |xy—-(@—e1)(y—€2)| (ey + ex — sre 
pay | xy > xy 
€1y + €2x 
= a = |er1 + €r2| < Bri + r2- 


The proof for the quotient is similar. 


Accumulation of determinate errors 


For the sum or difference y = a + b — c, where a, b, and c are measurable quantities, 
the absolute determinate errors, Aa, Ab, and Ac in these quantities, propagate as 
Ay = Aa + Ab — Ac. 


Example 8.4.1. Calculate the error in the result of the following calculation: y = 
+0.50(+0.02) + 4.10(—0.03) — 1.97(—0.05) = 2.63(Ay), where the numbers in 
parentheses are the absolute determinate errors. 


Solution: 
The absolute error is Ay = 0.02 + (—0.03) — (—0.05) = +0.04. 
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For the product or quotient y = ab/c, where a, b and c are measurable quantities, 
the absolute determinate errors, Aa, Ab, and Ac in these quantities, propagate in ac- 
cordance with the following relative error expression: (Ay/y) = (Aa/a)+ (Ab/b) — 
(Ac/c). 


Example 8.4.2. Calculate the error in the result of the following calculation, where 
the number in parentheses are the absolute determinate errors: 


__ 4.10(—0.02) x 0.0050(+0.0001) 
ae 1.97(—0.04) 


=0.01041(Ay). 


Solution: 
Ay _ —0.02 0.0001 


The error must be based on relative errors such that + 
4.10 0.0050 
—0.04 — 0.035 
197 00 


To obtain the absolute error in y, Ay = 0.035 x y = 0.035 x 0.01041 = 0.0004. Thus, 
the final result is y = 0.0104 (+0.0004). 


Accumulation of indeterminate errors 


In contrast to a determinate error no sign can be attached to a standard deviation 
(since this error is random) and therefore has an equal probability of being either 
positive or negative. Since these errors are independent (and random), they may can- 
cel one another. In the case of sums or differences, the errors therefore propagate in 
quadrature (that is, as the sum of individual absolute variances). Thus, for the quan- 
tity y =a + b — c, with standard deviations +sa, +sp, and +sc, the errors propagate 


as 
sy = s2 +52 +52. (8.24) 


Similarly, for a product or quotient y = ab/c, the relative variance of the result is 
equal to the sum of the individual relative variances so that 


(sy) = Dr + 6D + Dr. (8.25) 


where the relative error is the standard deviation normalized by the given quantity 
(for example, (Sq); = Sq/a). 


Example 8.4.3. Calculate the standard deviation for the following calculation: 


[14.3(+0.2) — 11.6(40.2)] x 0.050(-0.001) e 
y= = 1.725(+sy) x 107°. 
[820 (10) + 1030(£5)] x 42.3(£0.4) 


Solution: 
First one must calculate the standard deviation of the sum and difference. For the 
difference in the numerator, sa = (40.2)? + (£0.2)? = +0.28, and for the sum 


in the denominator, sy = y (£10)? + (+5)? = £11. The equation may be rewritten 
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[2.7(+0.28)] x 0.050(+0.001) 


[1850(+11)] x 42.3(+0.4) 
and quotients, the relative standard deviations of the individual quantities must be 


as y= . Since the equation only contains products 


determined: 
+0.28 +0.001 
oe L 0.104, — = +0.020, 
Csa) = 5 (so) = 9 950 
Ce] 000, and Gp = 5.000 
Sc r= 7950 . >» an Sd r= p3 °° , 


so that from Eq. (8.25): 


(sy); = V (40.104)? + (40.020)? + (+0.0060)2 + (0.0095)? = 0.106. 


The absolute standard deviation of the result is sy = 1.725 x 1076 x (40.106) = 
+0.18 x 1076. The final result is therefore 1.7(+0.2) x 1076. 


If y is a function of one variable, y(x), then 


d 
ôy = 2 ôx. (8.26) 
d 


Substituting in the standard deviations sy and sx for ôy and ôx, respectively, gives 


dy 


78: (8.27) 


=| 


Example 8.4.4. Suppose 6 = (7/3) + 0.052 rad. The best estimate of cos(x /3) = 
0.50. What is the uncertainty in cos(@)? 


Solution: 
Using Eq. (8.27), the uncertainty is Scoso = |d(cos@)/d@|sg = |sin@|sg = 
| sin(x /3)| x 0.052 = +0.045. Therefore, the final result is cos 0 = 0.50 + 0.05. 


Using the general relation in Eq. (8.27), the indeterminate error for exponential 
calculations, y = x”, is 


n—-1 


Sx => (Sy)r = |n| (Sx)r. (8.28) 


ny 
Sy = |nx Sy = |— 
° x 


Similarly, using Eq. (8.27) for the logarithmic expression, y = log x = 0.434 1n x, the 
standard deviation is derived as 


Sy = 0.434(s,)>. (8.29) 


Example 8.4.5. The standard deviation in measuring the diameter d of a sphere is 
+0.02 cm. What is the standard deviation in the volume if d = 2.15 cm? 


Solution: 
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4 (aN? 4 725" 3 , sv 
We have V = -x = -T = 5.20 cm”. Hence, using Eq. (8.28), — = 
3 A2 3 2 V 
Sd : 


3x Pi =3x ZI5 = 0.028. The absolute standard deviation in V is then sy = 5.20 x 


0.028 = 0.15. Therefore, V = 5.2 (+0.2) cm’. 


Example 8.4.6. Calculate the absolute standard deviations in (a) y 
log[2.00(+0.02) x 107°] = —2.6990+? and (b) x = alog[1.200(+0.003)] 


15.849+2. 
Solution: 
0.02 x 1073 
a) Using Ea. (8.29), sy = +0.434 x ————_ = +0.004. Thus, 
se Using Ma (Geta) * 3.00 x 10-3 E 


log[2.00(+0.02) x 1073] = —2.699 (+0.004). 


: ; +0.003 
(b) Rearranging Eq. (8.29), (sx); = a 5y 


= = = rv. . Th N 
c 044 a 


Sx = +0.0069 x x = +0.0069 x 15.849 = 0.11. 


Therefore, alog[1.200(+0.003)] = 15.8+ 0.1. 


Finally, in general, if y is any function of several variables x, ..., z, then for inde- 
pendent random variables, 


dy \? dy. \7 dy y ay \? 
ôy = —ôx | +... +{ —8z] >sy= —sy) +...+( =s: ] . (8.30) 
ax Oz i ax Oz 


It can be seen that Eq. (8.24) and Eq. (8.25) are simply special cases, which follow 
from Eq. (8.30). 


Total uncertainty 

For determinate errors (that is, a systematic component, dxsys;) and indeterminate 
errors (that is, a random component, 5x;andom), these quantities can be combined in 
quadrature (since they are independent) to yield an overall error: 


ôy =  xrandom)? =F (5xsys)°- (8.31) 


Although 6x;gndom can be reduced with a larger number of measurements (see 
Eq. (8.9) and Eq. (8.12)), the overall uncertainty ôx is ultimately limited by ôxsys. 
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Problems 


8.1 Given the function y = ab/c, where a, b, and c are independent variables with 
random errors of Sa, Sp, and sc, show that using the general error propagation formula 


one obtains the specific result (sy), = "i (sa)? + (sp)? + (s,)?. 


8.2 What is the expression for calculating the standard error of the intercept (yo) of 
a linear regression line? Hint: Use the confidence interval relation to determine this 
expression. 


8.3 For the logarithmic expression y = log x, show that the standard deviation sy is 
given by sy = 0.434(s,),. 

8.4 A concentration measurement (in ppm) was made consisting of the following 
values: 21, 15, 23, 21, and 24. Should the anomalously low value be refused based 
on a Q critical value at a 90% confidence? 


8.5 What is the difference between at distribution and a normal distribution as used 
for confidence interval prediction? 


CHAPTER 


Numerical analysis 


In engineering, one may be faced with a need to find the zero of a function or for 
numerical interpolation, spline fitting, or smoothing of acquired data. In addition, 
there may be a further requirement for numerical integration and differentiation. 

One of the most important tasks in engineering is optimization and approximation 
theory. This may involve the approximation of an equation with a solution at discrete 
points (see Chapters 6 and 7) or for the solution of a nonlinear system. The determina- 
tion of roots may arise in the finding of an optimal solution to a real-life problem. It is 
also important to find the zeros or poles of a transfer function in control theory or for 
the solution of differential equations using transform methods in Chapter 3. Numeri- 
cal interpolation and spline fitting provides a means to interpolate numerical listings 
in mathematical tables or for the interpolation of acquired data obtained in design and 
testing experiments. Lagrange interpolation polynomials allow one to develop shape 
functions in finite element methods (Chapter 7) or to assess the error in numerical in- 
tegration formulas where an analytical solution for the integral does not exist. It can 
be used for assessment of distorted geometries when loads are applied to deformable 
bodies. Spline fitting is particularly important to avoid oscillatory behavior in the de- 
velopment of models, which can arise when fitting higher-order polynomials to the 
data. Data smoothing may also be required for improved data analysis, where there 
may be a requirement to filter noisy data from measurements that have an inherent 
uncertainty due to difficult sampling procedures. 

Numerical differentiation is at the heart of numerical methods for solution of or- 
dinary and partial differential equations in the modeling of physical behavior (see 
Chapter 6). For instance, this approach is particularly needed for the solution of 
complex engineering problems that can arise, where the coefficients representing the 
thermal and physical properties of the materials in the underlying equations may 
be a function of the dependent-solution variable itself (e.g., with a dependence of 
the thermal conductivity on temperature in a heat conduction problem), resulting in 
a nonlinear problem. Numerical techniques are particularly adaptable to computer 
programs or algorithms. Computer-aided engineering relies on numerical analysis 
methods for the modeling of dynamical systems and numerical simulation of cou- 
pled differential and algebraic systems for real-time applications. 

Techniques for the solution of these various problems are described in this chapter. 
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9.1 Finding zeros of functions 


A solution of the equation 


f(x) =0 (9.1) 


involves finding a number x = s such that f(s) = 0. Equation (9.1) can con- 
sist of an algebraic equation (e.g., a polynomial such as x? — 3x + 2 = 0) ora 
transcendental equation (e.g., tanx = x). For a polynomial or transcendental equa- 
tion, the solution of Eq. (9.1) is called the roots of the equation. 


As seen previously, equations of the form of Eq. (9.1) may occur. For example, such 
problems arise in the solutions of characteristic equations and the zeros of Bessel 
functions, to name a few. Formulas that exist to give exact numerical values of the 
solution only exist in simple situations (e.g., solution of quadratic equations), and in 
most cases, an approximate iterative method is required. 


9.1.1 Fixed point iteration 
Equation (9.1) can be transformed algebraically into the form 


x = g(x). (9.2) 


Thus, choosing a value of xg, implies that x; = g (xo), x2 = g (x1), ..., and in general, 


Xn41=8(Xn)| (n=0,1,...). (9.3) 


From Eq. (9.1), several different forms of Eq. (9.2) can arise, and the behavior 
of the corresponding iterative sequences xg, x1, ... may differ accordingly. The iter- 
ative process in Eq. (9.3) is convergent for xo if the sequence xo, x1, ... converges. 
A sufficient condition for convergence for any xo in the interval J is that if x = s is 
a solution of x = g(x) and g has a continuous derivative in J (containing s), then 
lg'(x)|< K <1linJ. 


Example 9.1.1. Find a solution of f (x) = x? — 4x + 2 = 0 by iteration. 


Solution: 
The roots of a general quadratic equation ax? + bx + c can be obtained analytically 
from 
—b + Vb? — 4ac 
X12 = ——— n. 
2a 
Hence, inserting a = 1, b = —4, and c = 2 into this solution equation, the two roots 


are evaluated analytically as xj 7 = 2 + /2, yielding the values xı = 0.5858 and 
x2 = 3.414. 
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For the numerical solution, the equation may be written as 
1/4 lfa 
riage +2), thus xa+1 = 3 (x2 +2). 


1 
Then |gi(x)| = =|x| < 1 for any x < 2. Choosing xp = 1, one obtains xo = 


1.0000, x; = 0.7500, x2 = 0.6406, x3 = 0.6026, x4 = 0.5908..., which is approach- 
ing the exact solution for the first root of 0.5858. [answer] 


Note that if one chose x9 = 4, the sequence diverges as expected from the above 
theorem, that is, x9 = 4.000, xı = 4.500, x2 = 5.563, x3 = 8.235, x4 = 17.455.... The 
equation may also be written as 


2 
x= g:(x)=4— —,  thusx,4,; =4-—. 
X n 


2 
Then |g4(x)| = a 1 for x > J2. Thus, now choosing x9 = 4, one obtains the 


x 
sequence xo = 4.000, xı = 3.500, x2 = 3.429, x3 = 3.417, x4 = 3.415..., which con- 
verges to the second root at 3.414. [answer] 


9.1.2 Newton’s method 
This method is commonly used because of its simplicity and speed. In this method, 


one approximates the graph of f by suitable tangents (see Fig. 9.1). 


4 


y 
| y=f(x) 


fixe) 


& 
x y 


FIGURE 9.1 


Evaluation of f(x) = 0 using the Newton method. 


One starts the iteration process with an approximate value xo, from which a value is 
determined on the graph for f. Next using a tangent line at this point to the point of 
intersection on the x axis, one obtains the next value xı. The tangent to the curve of 
f at xo is evaluated from 


tanB = f’ (xo) = Lo) | and hence x; = xo — firo) : 
Xo — X1 f'(xo) 
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This process continues as the improved iterative values for the root approach the 
value at s using the iterative procedure: 


f&n) 
i) 


(9.4) 


Xn+1 = Xn — 


Example 9.1.2. Find the positive solution of x? — 4x + 2 = 0 by Newton’s method. 


Solution. 
Setting f(x) = x? —4x+4+2=0, f'(x) = 2x — 4 and Eq. (9.4) gives 


xp — 4x, +2 _ x22 
2xn—4 a, ee 


Xn+1 = Xn 


The solution is again near xọ = 1. Therefore, successive iterations yield xọ = 
1.00000, xı = 0.5000, x2 = 0.5833, x3 = 0.5858. The value of x3 is identical to four 
decimal places to the analytical solution in Example 9.1.1. [answer] 


Order of the method (speed of convergence) 


Given an iteration method xn+1 = g (xn), where x, is an approximate to the solution 
s, that is, s = Xn + €n (or Xn — S = —€n ), if g is differentiable many times, the Taylor 
formula gives 


1 
Xnt1 = 8 (2n) = B(8) +.8'(9) Gn = 8) + 58") in = 8)" + 
1 1 ” 2 
= g(s) — g (S)€n + 38 (sje; +... (9.5) 
The order of the iteration process is the exponent of en in the first nonvanishing term 


after g(s) (see the example below for the Newton method). For instance, subtract 
g(s) = s on both sides of Eq. (9.5): 


1 
Xn+1 — S = Ent = —8'(5)En + 58" (Sep (9.6) 
In Newton’s method, g(x) = x — f) and 
Fœ) 
; FO =f@OS"@) _ FOF") 
=] = 9.7 
ae Fay Fay aa 
Since f(s) =0, Eq. (9.7) gives g'(s) = 0. Differentiating Eq. (9.7) again and setting 
x = s yields 
g") = a (9.8) 
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Thus, Eq. (9.6) and Eq. (9.8) indicate that the Newton method is of second order, that 
is, 


E 
"HIO Ps) 


2 
Ek 


if f(x) is three times differentiable and f’ and f” are not zero at a solution s of 
f(x) = 0. This result means that if €n = 107% in some step, then for the second order, 
€n+1 = constant - 107% so that the number of significant digits is about doubled in 
each step. 


9.1.3 Secant method 
Replacing the derivative f’(x) in Eq. (9.4) by the difference quotient 


fn) — f On-1) 


Xn — Xn-1 


f'n) © 


yields the secant method: 


Xn — Xn-1 


—— 9.9 
f Xn) — f&n-1) Gel 


Xn+1 = Xn — f (Xn) 


A geometric interpretation is shown in Fig. 9.2. 


A 


y 
f 
— y=fix) 
secant | fX) 
wey 
— Ø = > 
S Xn+1 Xn Xn-1 x 


FIGURE 9.2 


Evaluation of f(x) =0 using the secant method. 


Example 9.1.3. Solve f(x) = x? — 4x + 2 = 0 by the secant method with x9 = 1 
and x; = 0.5. 


Solution. 
By Eq. (9.9): 


(x2 — 4xn + 2)(Xn — Xn—1) (x2 — 4xn +2) (Xn — Xn—1) 
Xn+1 = Xn z 7 = Xn 2 7 : 
Xq —4Xn +2 — x7) +4%n-1 — 2 Kg = Xg T 4n — Xn-1) 
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The numerical results are x9 = 1.0000, x; = 0.5000, x2 = 0.6000, x3 = 0.5862, x4 = 
0.5858. [answer] 


As in Example 9.1.2, x4 = 0.5858 is exact to four decimal places. 


9.2 Interpolation 


Interpolation is required to find (approximate) values of a function f(x) for an x be- 
tween different x values (x9, x1, ..., Xn) at which the values of f(x) are known (for 
example, as in mathematical tables or with recorded data). An interpolation polyno- 
mial py, (x) of degree n can be found that assumes the given values 


Pn(X0) = fo, Pn(*1) = fi, «5 Pan) = fh, (9.10) 


where fn = f (Xn). 


9.2.1 Lagrange interpolation 


In Lagrange’s form, the polynomial p, (x) is given by 


K(X) s 
Ik (xk yi 


fO)® pale) = OW fee 2 (9.11) 


k=0 


In Eq. (9.11), L(x) = 1 and Lg (x;) = 0 for j Ak. Thus, lg(x) is defined by 
lolx) = (x — x1)(x — x2)...x — Xn), 
Ig (x) = (x — X9)...(% — xR) (X — Xk+1).-- (x — Xn), O<k <n, (9.12) 


In(x) = x — x0) — 1)... — Xn-1). 


Inspection of Eq. (9.12) shows that J, (xj) = 0 if j #k, so that for x = x, the sum in 
Eq. (9.11) reduces to the single term (lg (xx))/Ck (xk) fk = fk, as required. 


Example 9.2.1. Find the linear Lagrange interpolation polynomial p(x), n = 1, 
from Eq. (9.11) and Eq. (9.12). 


Solution. 
We have 
x—X] x — Xo 
Pix) = Lo) fo + Lix) fi = - fot - fi. [answer] (9.13) 
x0 — x1 x1 — x0 


Example 9.2.2. Compute sinh(1.5) from sinh(1.0) = 1.1752 and sinh(2.0) = 3.6269 
by linear Lagrange interpolation. 


a eee 
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Solution. 
Given xp = 1.0, x; = 2.0, fo = sinh(1.0), and fı = sinh(2.0), from Eq. (9.13) in Ex- 
ample 9.2.1 we have 


inh(1.5) © p,.5) = 8 4.1752) 4 12 3.6269) = 2.4011. | | 
sin. 4 7X ý =e š E S a = 4 R answer 
E1 1.0—2.0 2.0—1.0 


Since sinh(1.5) = 2.1293 (exact to four decimal places), the error is € = a — å = 
2.1293 — 2.4011 = —0.2718. 


The error for any polynomial interpolation method, if f(x) has a continuous 
(n + 1)st derivative, is given by the formula 


FO 


TE (9.14) 


En(X) = f (x) — Pn) = (& — xo) Œ — x1)... — Xn) 


Example 9.2.3. Estimate the error in Example 9.2.2 using Eq. (9.14). 


Solution. 
Given n = 1, f (t) =sinht, f’(t) =cosht, f”(t) = sinht, one has 


€1(x) = (x — 1.0)(x — 2.0)5 (sinh) , 


which is evaluated as €; (1.5) = —0.1250sinht so that: 


t = 1.0 implies a smaller negative value of = —0.1469, 
t = 2.0 implies a larger negative value of = —0.4534. 


Hence, one obtains —0.4534 < €;(1.5) < —0.1469. [answer] 


This calculated range is in agreement with the reported error in Example 9.2.2 of 
—0.2718. 


9.2.2 Newton’s divided difference 


With the Lagrange method, the degree of the interpolation polynomial that will give 
the required accuracy is not known. However, the Newton method provides a means 
to simply add on another term to improve the accuracy. Newton’s divided difference 
formula is 


f (x) fo + (x — xo) fixo, x1] + (x — xo) (x — x1) f[xo, x1, x2]+ 


(9.15) 
we +H (x — xo)...(x — Xn-1) f x0, «5 Xn], 


So —_ eee 
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where 


fixo, x1] = h-hh 
x1 — x0 
nanais flx1, x2] — flxo, ol (0.16) 


X2 — Xo 
fix, wy XK] a fixo, set, HET] 


f[xo, ..., x2] = : 
Xk — X0 


Example 9.2.4 shows how to use a difference table for the evaluation of Eq. (9.15). 


Example 9.2.4. Compute sinh(1.5) from the given values in Table 9.1 using New- 
ton’s divided difference interpolation formula. 


Table 9.1 Example of a Newton divided difference table for interpolation. 


Xj fj = f (xj) S[xj,xj41] 1X7, Xj41,%j42] S [Xj +++) X43] 
0.5 0.5211 
1.3082 
1.0 1.1752 0.7623 
2.4517 0.4829 
2.0 3.6269 1.96972 
6.3911 
3.0 10.018 
t 
Given 
values 
Solution. 


The divided differences are shown in Table 9.1. For example, a sample calculation 
for “*” is (6.3911 — 2.4517)/(3.0 — 1.0) = 1.9697. The values needed in Eq. (9.15) 
are placed in boxes, and the calculation is as follows: 


f(x) & p3(x) =0.5211 + 1.3082(« — 0.5) + 0.7623 (x — 0.5) (x — 1.0) 
+ 0.4829(x — 0.5)(x — 1.0)(x — 2.0). 


Therefore at x = 1.5, f (1.5) = 2.0897. [answer] 


Note that the exact value to four decimal places is sinh(1.5) = 2.1293. With this 
method, the accuracy increases from term to term: 


pı(1.5) = 1.8293, p2(1.5) = 2.2105, and p3(1.5) = 2.0897. 


“eee 
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Newiton’s forward difference formula (equal spacing) 


Newton’s formula in Eq. (9.15) is for arbitrary-spaced nodes. However, if the x;s are 
regularly spaced (as in function tables) such that 


X0, x1 +h, x2 = xo + 2h, ..., Xn = xo +nh, 


then Eq. (9.15) becomes Newton’s forward difference interpolation formula: 


LOTO (") Afo  @Œ=xo+rh,r = (x — x0)/h) 


s=0 


r(r ; 1) Ah ee r(ir— Dat —n+1) 


=fo+rAfot+ A" fo, 


(9.17) 


where the forward differences of f at x; are defined by 


Afi = fin- fi A fj = Afni Afj o MFG = ANT fja A fy. 


(9.18) 
Similarly, the error for this interpolation method is 
n+1 
En(x) = f (x) — prx) = r(r—1)...r —n) fA). (9.19) 
(n+1)! 


Example 9.2.5. Compute sinh 1.56 from the given values using Newton’s forward 
difference formula. 


Solution. 

Using Eq. (9.18) and the given values, the forward differences are computed in Ta- 
ble 9.2. The values needed in Eq. (9.17) are placed in boxes. Also in Eq. (9.17), 
r = (1.5 — 0.0)/0.1 = 1.5. The calculation is as follows: 


1.5(0.5) 


f (1.5) = sinh(1.5) = p3(1.5) = 0.0000 + 1.5(1.1752) + — 


$ 1.5(0.5)(—0.5) 


(1.2765) 


(2.6629) = 2.0751. 


For the error estimate in Eq. (9.19), sinh ¢ = sinhż so that 
1 
63(1.5) = rT x 1.5(0.5)(—0.5)(—1.5) snht = Asinht (A = 0.02344). 
Here 0.0 < ż < 3.0 and one obtains an inequality for the largest and smallest sinh¢ in 


the interval 


Asinh0.0 < €3(1.5) < A sinh 3.0. 


OOOO... 
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Table 9.2 Example of a Newton forward difference table for 
interpolation. 


Jo] xy fj =sinhx; | Af; A? f; A? f; 
0.0 0.0000 
1.1752 
1 1.0 1.1752 1.2765 |° 
2.4517 2.6629 
2 2.0 3.6269 3.9394 
6.3911 
3 3.0 10.018 


a A? f = 2.4517 — 1.1752 = 1.2765. 


Since f(x) = p3(x) + €3(x), the error bands are 


p3(1.5) + Asinh0.0 < sinh 1.5 < p3(1.5) + A sinh 1.5. [answer] 
Ka a a 
2.0751 2.3099 


In fact, the exact answer to four decimal places (sinh 1.5 = 2.1293) lies within these 
bounds. 


9.3 Splines 


For various functions f(x), the corresponding interpolation polynomials in Sec- 
tion 9.2 tend to become more numerically unstable as the degree of the polynomial n 
increases (see Fig. 9.3). Thus, instead of interpolating and approximating by a single 


f. eee Pax) A Pal) 


z7 Ny SK i S i: i \ i 
we x / \ ; / J ù i 
- cA ET 
FIGURE 9.3 


Various-order interpolation polynomials showing instability for higher-order polynomials. 


high-degree polynomial, it is preferable to use a cubic spline g(x) for the function 
f(x) on the interval a < x < b, by subdividing it into subintervals with common 
endpoints (called nodes): 


a = x0 < X1 <... < Xn =b, (9.20) 
where 


80) = fo) = fo, SAD = f) = fis e 8n) =f An) = fr. 9.21) 


an “nT EEE 
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Also if one requires that 
g' (xo) = ko, 9! (Xn) = kn (ko, kn two given numbers), (9.22) 


then the cubic spline can be uniquely determined. For each subinterval x; < x < 
Xj+1 =x; +h, the spline g(x) is given by a cubic polynomial: 


pj(x) =ajo + aji(x — xj) + aja(x — xj) + aj3(x — ay. (9.23) 


The function f(x) is now approximated by n polynomials. The coefficients for the 
polynomial p;(x) in Eq. (9.23) are obtained from a Taylor series formula: 


ajo = pj(xj) = fj, 
aj, = ph (xj) =k, 


1 3 1 
aj = 5 Pi) = (fj41 - fi) ' (kj41 + 2k;), (9.24) 


1 2 1 
aj3 = zri a) = (fi — fis) + 72 (kj+ı +kj)- 


The kı, ..., kn»—1 constants in Eq. (9.24) are obtained from the relation 


3 ; 
kj-1 +4kj + kj =F (fji = fi-v); j=l,...n-1 (9.25) 
(noting that ko and kn are defined by Eq. (9.22)). 


Example 9.3.1. Interpolate f(x) = e% — 1 on the interval —1 <x < 1 by the cubic 
spline g(x) corresponding to the partition x9 = —1, x; =0, x2 = 1 and satisfying 


g(x) = f’(-1) and g(1) = f0). 


Solution. 

In standard notation, fo = f(—1) =e — 1, fi = f(0)=0, fo = fC) =e — 1. The 
given interval is partitioned into n = 2 parts of length h = 1. Hence, the spline g 
consists of two polynomials of the form of Eq. (9.23): 


po(x) = ao +a + 1) +a +1)? +a? (-1 <x <0), 
Pilx) = aio + a11x + a12x° +.413x° (O<x <1). 
As given, f’(—1) = —2e = g/(—1) and f’(1) = 2e = g’(1). Hence, using Eq. (9.22), 
ko = —2e and kz = 2e. Substitution of these values into Eq. (9.25) yields ko + 4k, + 
3 
k = FAC, — fo) = 0. Thus, kı = 0. 
The coefficients of the two polynomials can now be evaluated from Eq. (9.24): 
Coefficients of po: doo = fo=e-—1, 
agi = ko = —2e, 


e ee 
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ao2 = 3( fi — fo) — (ki + 2ko) =3 +e, 
ao3 = 2( fo — fi) + (kı + ko) = —2, 


Coefficients of pi: aio = fi =9, 
aj, =k, =0, 
a2 = 3( fo — fi) — (ko + 2k1) =e — 3, 
a3 =2(fi — fo) + (k2 + kı) = 2. 


The cubic splines after using Maple to simplify the resulting expressions are 
po(x) =x" (e —3 — 2x), 
pi(x) =x? (e —34+ 2x). 

The spline g(x) approximating f(x) (see Fig. 9.4) is 


x*(e—3-—2x) if (-l<x <0), 


g(x) = ee —342x) if O<x<1). [answer] 


-0.5 


FIGURE 9.4 


Example of a cubic spline fit of a function. 


9.4 Data smoothing 

A practical technique for the smoothing of measured data in engineering applica- 
tions is the well-known Savitzky—Golay (S—G) filter (Savitzky and Golay, 1964). This 
technique employs a simple arithmetic calculation for the smoothing of data over an 
interval with a chosen number of points. For example, a popular choice is a quadratic 


ao C(‘(‘(‘“‘(‘(‘a‘é‘“a“‘#és#‘C“R_R®DOo 
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fit (with a fitting polynomial of order n = 2) to a seven-point data interval (where the 
index i ranges from —3 to +3) for the points y; in the interval. Coefficients for other- 
order polynomials and data intervals are tabulated in Savitzky and Golay (1964). For 
the use of this technique, the data must be equally spaced. The measured data are 
also reduced in size with a loss of the first three and last three data points in this 
seven-point calculation. As mentioned, the fitting coefficients are evaluated with a 
simple arithmetic operation providing both a smoothed value of the quantity of inter- 
est as well as its derivative as required. The coefficients for the smoothed value and 
derivative are given, respectively, as 


1 
b2 = TI [—2y_3 + 3y_2 + 6y_1 + 7yo + 6y1 + 3y2 — 2y3], 


1 
ba, = 78 [—3y_3 — 2y_2 — ly_1 + Oyo + lyi + 2y2 + 3y3]. 


Using a moving calculation with a standard Excel spreadsheet analysis, one sim- 
ply shifts the center of the interval from the point i to the point i + 1 and recalculates 
for the next point. Thus, the formula is applied for the ith data point after which the 
corner tab of the cell is pulled down to extend the calculation over the remaining 
data entries. Thus, the smoothed value (and derivative as needed) can be found at the 
subsequent points. 


Example 9.4.1. Consider a constant rate of release R of material into a closed 
system, where there is also a loss of material characterized by a first-order rate con- 


dN 
stant k. The mass balance of material N(t) at time t follows as — = R — kN. 


t 
If there is no material in the system at time zero, N(0) = 0. Thus, the solution 
of this ordinary differential equation using an integrating factor (Example 2.1.8) is 


R 

N(t)= T (1-— Co, or equivalently A(t) = kN (t) = R(1 — e~*). At equilibrium 
as t > œ, A —> R. Fig. 9.5 shows a history plot of A(t) with an associated random 
error. For this plot, the rate constant is k = 0.0862 d`! and the normalized release 
rate is R = 1. This problem is analogous to Problem 7.7 for the coolant activity of 


fission products in the primary heat transport system of a nuclear reactor. 


Given the case of noisy data in Fig. 9.5, evaluate (a) the smoothed quantity A(t) and 
(b) the smoothed release rate R as a function of time using an S-G filter. 


Solution. 


(a) The smoothed value A(t) is simply given by applying b2ọ to the individual data 
points in the figure with an Excel analysis. The equation is first centered on the fourth 
point (using three points before this point and three points afterwards for the simple 
arithmetic operation). One then moves to the next point along the curve repeating the 
calculation with the original data. The smoothed curve for A(t) is shown in Fig. 9.5. 


o 


278 CHAPTER 9 Numerical analysis 


0.8 
0.6 œ 
0.4 
+ 
0.2 0.2 
+ 
0 + 0 
0 5 10 15 20 25 30 35 
Time (d) 
+ Measurement A(SG) ----- R(SG) R (analytic) 


FIGURE 9.5 


Mass balance analysis for noisy data using an S-G filter. Note that smoothed values are not 
available for the first three and last three data points for the seven-point fitting. 


dA 
(b) The mass balance equation can be rearranged to solve for R as R = kde +Az= 
1b 
= x + b20, where At = 1 d in accordance with the equal spacing of the data. The 
smoothed curve for R(t) is shown in Fig. 9.5 along with the constant analytic value 


of R equal to unity. 


9.5 Numerical integration and differentiation 


If a definite integral cannot be obtained by usual methods of calculus or the integrand 
is an empirical function given by measured values, numerical integration can be used. 


9.5.1 Trapezoidal rule 

In this method, the interval of integration a < x < b is subdivided into n subintervals 
of equal length h = (b — a)/n and the function f(x) is approximated by a chord 
with endpoints [a, f(a)], [x, f(x)], ..., [b, f (b)] on the curve of f (see Fig. 9.6). 
The area under the curve of f is approximated by n trapezoids of areas JI f(a)+ 
fh, Hf) + f&œ)]h, ..., MS Gn-1) + f(b)]h. Hence, the trapezoidal rule 
is obtained as the following sum: 


b 1 1 
J =f f (x) dx ETAO + fea) + fa) t -t fan) zf 


(9.26) 


ee = = 
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FIGURE 9.6 


Calculation of the area under a curve using the trapezoidal rule. 


The error in Eq. (9.26) can be derived as follows. Using Eq. (9.14) for the error in 
the Lagrange interpolation polynomial with n = 1 for a single subinterval, 


TMO 


fœ) = Pi@) = & — x0) — d 


Integrating over x for the first trapezoid (that is, from a = xo to x} = xo +h) gives 


x+h x+h ï 
| d-na f= a0) = 9 — E ae. 


0 x0 2 


Setting € = x — xo and applying the mean value theorem of integral calculus yields 


x+h NE h 3 3 es 
FÖ (É PFO 
[ Fa- pi) dx= Í se -mae= (5 >) 


where f is a suitable value between xp and xı. Hence, the error in € in Eq. (9.26) for 
n intervals is 


b-a) 
a 


12 


(b — a)? 
12n2 


nh? z | z zs 
e=- f"O=- f"O=- f"@, 
12 
with f being a suitable value between a and b. The error bands are now obtained by 
taking the largest and smallest values of f” in the interval of integration (that is, M2 


and M>); hence 


(b-a)? 


KM <€< KMž, where K =— 
12n? 


(9.27) 


e OOOO. 
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1 
Example 9.5.1. Evaluate f sinh(x) dx by the trapezoid rule and estimate the error 
0 
with n = 10. 


Solution. 
Using Eq. (9.26) one has 


inh(O 
Jx oaf 2 ) + sinh(0.1) + sinb(0.2) + sinh(0.3) + sinh (0.4) + sinh(0.5) 
; . : ; sinh(1) 
+ sinh(0.6) + sinh(0.7) + sinh(0.8) + sinh(0.9) + = 0.543533. 


The error can be calculated from Eq. (9.27). By differentiation, f” (x) = sinh x. Also, 
f(x) > 0 if 0 <x < 1 so that the minimum and maximum occur at the ends of the 
interval. Hence, M3 = f”(1) = 1.175201 and M3 = f”(0) = 0, and K! = —1200. 
Therefore, Eq. (9.27) gives —0.000979 < e < 0.000000 so that the exact value must 
lie between 


0.542554 = 0.543533 — 0.000979 < J < 0.543533 + 0.000000 = 0.543533. 


[answer] 


Actually, J = 0.543081, which is exact to six decimal places, so that J lies within 
these bounds. 


9.5.2 Simpson’s rule 


A linear piecewise approximation of f gives a trapezoidal rule of integration, whereas 
a higher accuracy can be obtained with a piecewise quadratic approximation. 


Here the interval of integration a < x < b is divided into an even number of 
subintervals (that is, n = 2m) of length h = (b — a)/2m with endpoints x9 = 
4, X1, -.-;X2m—1,X2m = b (see Fig. 9.7). Considering the first two subintervals x9 < 
x <x2 = xo + 2h, f(x) can be approximated by the Lagrange interpolation poly- 
nomial p2(x) through the nodes (xo, fo), (v1, f1), (x2, f2), where fj = f (xj), using 
Eq. (9.11), 


(x — x1)(x — x2) (x — xo) (x — x2) (x — xXo0)(x — x1) 
(xo — x1) (x0 — x2) ` j (xı — x0) (x1 — x2) ! (x2 — xo) (x2 — x1) ` 3 


p2(x) = 


The denominators are 2h”, —h, and 2h, respectively. Setting s = (x — x1)/h im- 
plies (x — x9) = (s + Dh, (x — x1) = sh, and (x — x2) = (s — Ih. Hence, 


1 1 
pals) = 3868 = Df = (s + Dis = Dfi + is + sf. 


Integrating from xo to x2 yields 


ee Ul ree 


9.5 Numerical integration and differentiation 281 


a 


y 
Second parabola 
First parabola / 
Ym a =f(x 
y N Pá fix) Last parabola 
ý 4 i / 
V 
| > 
@ Xz, Xz Xz Ay vee Xaaa Dx 


FIGURE 9.7 


Calculation of the area under a curve using the Simpson rule. 


x 1 
f fæaxan f mojas=r|£+$n+] 
xo —1 


3 3 
A similar formula holds for the next two subintervals from x2 to x4, and so on. There- 
fore, summing these m formulas gives the Simpson rule: 


4 h 
jel fœ)dx x 3 [fo +4fi +2f2 +43 + ...+2 fom—2+4fom—1+ frm). 


(9.28) 
The error bounds for Eq. (9.28) are obtained by a similar method to that of the 
trapezoidal rule (assuming the fourth derivative of f exists and is continuous): 


; b-a 
CM4 <e< CM, 7 where C = ~ 180m)’ (9.29) 


where M4 and Mj are the largest and smallest values of the fourth derivative in the 
interval of integration. 


1 
Example 9.5.2. Evaluate f sinh(x) dx by the Simpson rule and estimate the error 
0 
with n = 10. 
Solution. 


Using Eq. (9.28) one has 


3 
+ 4(sinh(0.5)) + 2(sinh(0.6)) + 4(sinh(0.7)) + 2(sinh(0.8)) + 4(sinh(0.9))) 


je% [simo + 4(sinh(0.1)) + 2(sinh(0.2)) + 4(sinh(0.3)) + 2(sinh(0.4)) 


+ inn | = 0.54308094. 


a OOOO. 
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The error can be calculated from Eq. (9.29). By differentiation, f IV (x) = sinhx. 
By considering the derivative of fV (that is, fY), the smallest value of f V in 
the interval of integration occurs at x = 0 and the largest value at x = 1. Therefore, 
M4 = f!V (1) = 1.1752012 and Mł = f!Y (0) =0. Since 2m = 10 and b — a = 
1, one obtains C = —1/(180 x 10*) = —0.00000056. Therefore, from Eq. (9.29), 
—0.00000065 < e < 0.0000000 so that the exact value must lie between: 


0.54308028 = 0.54308094 — 0.00000065 < J < 0.54308094 + 0.000000 
= 0.54308094. [answer] 


In fact, J = 0.54308064, which is exact to eight decimal places, so that J lies within 
these bounds. Note that for the same number of nodes, the present result is much 
better than that obtained in Example 9.5.1 with the trapezoidal rule. 


9.5.3 Gaussian integration formula 


This method provides a high degree of accuracy; however, it requires an irregular 
spacing of x1, ..., Xn, where 


1 n 
J fdr Ae. =f (9.30) 
S = 


Here the values x1,...,x, are the n zeros of the Legendre polynomial P,, (x) (see 
Eq. (2.144)), where 


1 1 
Po=1, Pi(x) =x, Po(x) = 5 Bx? — 1), P3(x) = 5 (Gx? — 3x), _ 


The numerical value of the zeros of P,(x), and the corresponding coefficients 
Aj,..., An are given in Table 9.3. 


Table 9.3 Values of zeros and coefficients for 
the Gaussian integration formula. 


n Zeros of P,,(x) Coefficients, A; 
2 +1//3 1 
3 O 8/9 
+,/3/5 5/9 
4 +,/ (15 — ¥120)/35 0.6521451549 
+,/ (15 + v 120)/35 0.3478548451 


ee lL rere 
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The method can be applied to an integral with any constant limits by applying an 
1 


appropriate transformation so that f(x)dx = Í g(x)dx. This transformation 


is derived by letting x = mt + c, where x =a whent = —1, and x = b when t = +1. 
These requirements yield the two following equations: 


a=m(—l)+c, 
b=m(1)+c, 


b+2 Í ; 
and c = ——. Hence, the required transformation be- 


b—a b+a b—a . 
comes x = t+ TE and dx = z dt. Finally, 


2 
b dx= b—a l b-a, b+a dt 931 
[roe hea )e) e 


1 
Example 9.5.3. Evaluate J sinhx dx by the Gaussian integration formula with 
0 
n=3. 


i . b-a 
with the solution m = 


Solution. 
Using Eq. (9.31) and Table 9.3 one has 


1 
z=] sinh x dx 
0 


T inh eea 
= sinh — 
2 J 2 


= [Ss] (E) s] fE (E) 


= 0.54308037. [answer] 


This result is as accurate as the result in Example 9.5.2 but requires fewer operations. 


9.5.4 Numerical differentiation 


Numerical differentiation involves the computation of a derivative of a function f 
from given values of f. Such formulas are basic to the numerical solution of differ- 
ential equations. 


fath- fa) 


Defining f; = f'(x;), Fy = f" (xj), where f'(x) = jim, 7 


tains the relations 


, one ob- 


hie fiz fo 


fip® ie * 


M U 
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and 


Mon, 8’ fi = f2 -2fit fo 
f 1 hz h2 ` 
More accurate approximations can be obtained by differentiating suitable Lagrange 
interpolation polynomials. For example, given the Lagrange interpolation polynomial 
as before, 


(x — x1) (x — x2) (x — xo) (x — x2) (x — xo) (x — x1) 
2h? Jo h2 iss 2h2 


p2(x) = fr, 


where x1 — x9 = x2 — x1 = h, one obtains 


X—X1—X2 2x — x0 — X2 2x — x0 — X1 
22 he ; 2h? 


2 
f(x) S ph) = fr. 


Evaluating this expression at the points xo, x1, x2 yields the “three-point” formula: 


1 

h~ z 3 fo + 4A — fr), 
1 

fi~ zz CPt fa), 
1 

h~ z PTA +3 h). 


Similarly, using the Lagrange interpolation polynomial p4(x) yields 


1 


tn pp DSA fa). 


Problems 

9.1 Solve x = cosx by: 
(a) fixed point iteration (xp = 1, 20 steps, six significant figures); 
(b) Newton’s method (xp = 1, six decimal places) (sketch the function first); 
(c) the secant method (xp = 0.5, xı = 1, six decimal places). 


9.2 As shown in the moving-boundary Problem 5.7 with a liquid front during the 
melting of a material, the following transcendental equation arises for the variable y: 
o (Uy — Um) 
Lyr 
of the other constants and material property values were known), define the function 
f(y) to be used and give the iteration scheme for each of the following methods: (i) 
fixed point iteration, (ii) Newton’s method, and (iii) the secant method. For applica- 
tion of the Newton method, what is the expression for f'(x)? 


yer erf(y) = . To numerically solve this transcendental equation (if all 


ao OO O 
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9.3 Consider the following tabular data: 1n9.0 = 2.1972, ln9.5 = 2.2513, and 
In 11.0 = 2.3979. 

(a) Calculate the Lagrange interpolation polynomial p2(x) and compute approxi- 
mations of ln 9.4, In 10, In 10.5, In 11.5, In 12. Evaluate the errors by using exact val- 
ues to four decimal places. 

(b) Comment on the effect of extrapolation. 

(c) Setup Newton’s divided difference formula for the same data, and derive from 
it p2(x). 


9.4 Find the cubic spline to the given data with kg = —2 and kz = — 14. Data: fọ = 
f(—2) = 1, fi = fO) = 5, fo = f(2) = 17. What is the advantage of a spline fit 
compared to a Lagrange polynomial for interpolation? 


9.5 Evaluate the definite integral J = f using: 


0 1 + X 2 
(a) an exact analytic formula obtained from calculus; 
(b) approximate methods: (i) the trapezoidal rule (n = 4); (ii) Simpson’s rule 
(2m = 4); and (iii) the Gaussian integration formula (n = 4). 
(c) Compute the error bounds for the trapezoidal rule in part (b). 
(d) Compare the results obtained in parts (a) and (b). 


9.6 The probability P that a measurement will fall within ¢ standard deviations is 


—(x — w)? 
who P| — z 
dx. 


iven by P = 
E y J, ON Qn 
(a) Show how this probability expression reduces to the normal error integral P = 
2 
1 | A 
——— e 2 dz. 
V20 L 


(b) Using the normal error integral and the Gaussian integration formula (for n = 
2), calculate the probability that a measurement will fall within one standard deviation 
(that is, t = 1). What is the relative error for this estimation? 

oad * sinu O? 
9.7 The sine integral is defined as Si (x) = — du. Evaluate the quantity Si(1) 
0 u 
(that is, for x = 1) using a Gaussian integration formula to four significant digits using 
the zeros of the Legendre polynomial P(x). Using the actual value from a mathemat- 
ical handbook (for example, Spiegel, 1973), estimate the relative and absolute error 
for the estimate. 


9.8 For application of the Gauss integration method, show that using Eq. (9.31) the 
following integral can be transformed as 


f -2 q T le+ ila 
e LS ex = è 
0 aJa Va 


CHAPTER 


Introduction to complex 
analysis 


Complex analysis is important in many fields of engineering. For instance, appli- 
cations in electrical engineering include control theory (transfer functions), signal 
analysis in communications, Fourier transform analysis, circuit and electronic theory, 
electromagnetism (time-harmonic fields), and electrostatics (solution of the Laplace 
equation for the complex potential). In addition to its application in electrostatics, the 
complex potential is a powerful methodology that can be further employed to solve 
analogous problems in fluid and heat flow in chemical engineering, as well as air- 
foil design in the aerospace industry. Complex analysis can be further used to solve 
boundary value problems for solution of differential equations, as well as for evalu- 
ation of inverse transforms and definite integrals. Complex functions arise in power 
transmission, and complex number field theory is also an important tool in quantum 
mechanics. 

This chapter introduces the properties of complex functions and integration meth- 
ods leading to the development of the residue theorem. For example, the residue 
theorem is used to evaluate definite integrals in Chapter 3, as well as to determine 
an inverse Laplace transform for solution of a heat conduction problem in Chapter 5. 
Finally contour mapping is applied in this chapter for the solution of Dirichlet bound- 
ary value problems and for various engineering problems in fluid flow, electrostatics, 
and heat flow. 


10.1 Complex functions 


A function w of a complex variable z can be written as w = f(z). The function is 
single-valued if for each value of z there corresponds only one value of w; otherwise 
it is multivalued. 


In general, w = f(z) = u(x, y) + iv(x, y), where u and v are real functions of x and 
y. For example, w = z? = (x + iy)? = x? — y? + 2ixy =u + iv. As such u(x, y) = 


x? — y? and v(x, y) = 2xy. 


Limits and continuity 


The function f(z) is said to have a limit as z approaches zo if, given any € > 0, 
there exists a ô > 0 such that | f(z) — L| < € whenever 0 < |z — zo| < ô. Alternatively, 
f(z) is continuous if lim f(z) = f (zo). 

z—> 20 


Advanced Mathematics for Engineering Students. https://doi.org/10.1016/B978-0-12-823681-9.00018-6 2 8 7 
Copyright © 2022 Elsevier Inc. All rights reserved. 


a U‘ 
288 CHAPTER 10 Introduction to complex analysis 


Derivatives 


If f(z) is single-valued in some region of the z plane, the derivative of f (z) is defined 
as 


_——— (10.1) 


Az 


eo i 


provided the limit exists independent of the manner in which Az — 0. If the limit in 
Eq. (10.1) exists for z = zo, then f(z) is called analytic at zo. If the limit exists for all 
z ina region , then f(z) is called analytic in R. In order to be analytic, f(z) must 
be single-valued and continuous (however, the converse is not necessarily true). 

This definition for the derivative is analogous to that of a real function. However, 
in contrast, complex derivatives and differential functions satisfy a much stronger 
condition. Complex functions that are differentiable at every point on an open subset 
of the complex plane are said to be holomorphic. Here, the value of the difference 
quotient in the limit of Eq. (10.1) must approach the same complex number, regard- 
less of the way that zo is approached. Hence, holomorphic functions are infinitely 
differentiable. They are also analytic at every point in the domain as given by a con- 
vergent power series. 

Elementary functions of a complex variable are natural extensions of the cor- 
responding real functions, that is, where series expansions for real functions f(x) 
exists, the complex series is obtained by replacing x with z. For example, 


a, z2 z 
e = FET aye ap Po 
B 3 zò z/ 
ann cee al 
z? zt z6 
cosz=1——+—— — 
2 4! 6! 


Also from these series it follows that e? = e*t'” = e* (cos y + i sin y). On letting 
z = i0, one obtains the so-called Euler formula: e!? = cos 6 + i sin@. The polar form 
of a complex number is pe’. Moreover, if p is any real number, de Moivre’s theorem 
states [p (cos + i sin9)]? = p? (cos p9 +i sin pd). 

The number w is called the nth root of the complex number z if w” = z (or 
equivalently w = z!/"). Moreover, if n is a positive integer, de Moivre’s theorem 
yields 


z!/" = [p (cos +isind)|!/” 


6 +2k 6 + 2k 
= piln eo (==) isin (a) ~ £0] ik 
n n 


Thus, there are n different values for z!/” provided that z 4 0. 


Example 10.1.1. Show that Inz is a multivalued function. 


se Eee 


10.2 Complex integration 289 


Solution. 

Define a? as e , where a and b are complex numbers. Since e = 1, we have 
el? = el (+k) where k is an integer. One therefore defines In z = In(pe!*) = Inp + 
i(@+2kz). Hence, In z is a multivalued function (that is, this multi-valued function is 
composed of various single-valued functions that are called its branches). [answer] 


bina 20ki 


Rules for differentiating functions of a complex variable are the same as those for 


d 
real variables. For example, T (z”) =nz"—! and = z) = cos z. 
Z Z 


Cauchy-Riemann equations 
A necessary condition for w = f(z) = u(x, y) + iv(x, y) to be analytic in a region N 
is that u and v satisfy Cauchy—Riemann equations: 


E ene (10.2) 


If the partial derivatives in Eq. (10.2) are continuous in i, the equations are sufficient 
conditions that f(z) is analytic in 9. If the second derivatives of u and v with respect 
to x and y exist and are continuous, one finds that differentiating Eq. (10.2), 


3u P du ai dv M dv m (103) 
əx? əy? x2 ðy ` ` 


Thus, the real and imaginary parts satisfy Laplace’s equation in two dimensions. Such 
functions are called harmonic functions. 


10.2 Complex integration 


If f(x) is defined, single-valued, and continuous in K, the integral of f(z) can be 
defined along some path C in A from point zı (= x; + iy) to point z2(= x2 +iy2) as 


(x2, 2) 
| fod= f i (u+iv)(dx +idy) 
C ( 


x1,y1) 


(x2,y2) (x2,y2) 
=i (udx —vdy) +i f (vdx + udy). 
( 


1,91) (x1,y1) 


With this definition, the integral of a function of a complex variable can be made 
to depend on line integrals of real functions. The rules for complex integration are 
similar to those for real integrals. For instance, 


f f(z)dz 
c 


<f f©lldz<M | ds= ML, (10.4) 
C C 


OOOO 
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where M is an upper bound of | f(z)| on C, that is, | f(z)| < M, and L is the length 
of the path C. 


Cauchy’s theorem 


Let C be a simple closed curve. If f(z) is analytic within the region bounded by C 
as well as on C, then Cauchy’s theorem states 


/ fde= f(zdz=0, (10.5) 
c c 


where the second integral emphasizes the fact that C is a simple closed curve. 
22 


Eq. (10.5) is equivalent to the statement that J (z) dz has a value independent of 


Zi 
the path joining zı and z2. Such integrals can be evaluated as F (z2) — F (z1), where 
F'(z) = f(z). For example: 


(i) Since f(z) = 2z is analytic everywhere, for any simple closed curve C : 


$2: dz=0. 
c 
1+i 


=( +D- Q2? =2i +4. 


1+i 
(ii) We have f 2zdz =z 
2i 


2i 


Example 10.2.1. Derive Cauchy’s theorem from first principles. 


Solution. 

Eq. (10.5) is a line integral, where f(z) = u + iv is analytic over a region bounded 
by the closed C and dz is an infinitesimal part of the path around C such that dz = 
dx + idy. Hence, Eq. (10.5) can be written as 


§ fode= plu ivy(dx tidy) = $ (udx—vdy) +i d (vdx +udy). (10.6) 
C C C 


Consider the Green’s theorem from Eq. (4.13) in Section 4.4.2: 


gp dx+ody= ff ($e - =) axay, 
Ë A \ Ox dy 


where P(x) and Q(x) are well-behaved functions and A is the area bounded by the 
contour C. One can apply this theorem to each of the terms in the last expression of 
Eq. (10.6). Therefore, for the first term, letting P = u and Q = —v yields 


f wax —vdy)= ff ees dxdy=0. 
c A\ 0x dy 


“Eee 
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This expression equals zero because of the second Cauchy—Riemann condition in 
Eq. (10.2). Similarly, letting P = v and Q = u in the second term yields 


ðu ðv 
f wax+uav= ff X E) ardy=0. 
c a\ðx əy 


This expression also equals zero because of the first Cauchy—Riemann condition in 
Eq. (10.2). Thus, Eq. (10.5) is shown to equal zero. [answer] 


Cauchy’s integral formulas 


If f(z) is analytic within and on a simple closed curve C and a is any point interior 
to C, then 


1 
ae fC) 


2xi Jcz—a 


dz, (10.7) 


where C is traversed in the positive counterclockwise sense. Also the nth derivative 
of f(z) at z =a is given by 


MORET) f@) ay (10.8) 


ri c z- a)ntl à 


Eqs. (10.7) and (10.8) are called Cauchy’s integral formulas. These are quite remark- 
able because they show that if the function f(z) is known on the closed curve C, 
then it is also known within C, and the various derivatives and points within C can 
be calculated. Thus, if a function of a complex variable has a first derivative, it has all 
higher derivatives as well. This result of course is not necessarily true for functions 
of real variables. 


COS Z ; ; 
dz, where C is the circle |z — 1| = 3. 
x 


Example 10.2.2. Evaluate $ 
cz> 
Solution. 
; : me 1 COS Z 
Since z = z lies within C, —— 
2mi Jo z—-1 
cos z and z= 7. Therefore, 


dz = cos m = —1 by Eq. (10.7) with f(z) = 


COS Z 
$ dz = —2ri. [answer] 


Example 10.2.3. Derive the Cauchy integral formula in Eq. (10.7) from first princi- 


ples. 

Solution. 

Consider the function ¢ (z) = fo , which is analytic inside of C except at the 
z— 


point a, which lies within C, where there is a singularity. Furthermore, let C’ be a 
small circle of radius p that is centered at a and make a cut between C and C’ along 
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AB, as shown in Fig. 10.1. Furthermore, consider the path in Fig. 10.1 from A around 
C to A’, along A’ B’, around C’ from B’ to B, and back along BA. 


FIGURE 10.1 


Schematic of a closed path for evaluation of Cauchy’s integral formula. 


From Cauchy’s theorem in Eq. (10.5), 


§ o(z)dz=0 
AA'B'BA 


for the closed path. Since ¢ (z) is analytic in the region between C and C’, we have 


/ ods f ode f boaz f ġ(z)dz =0. 
AA! AB! B'B BA 


If now both A and A’, and B and B’ each come together and coincide, then the 
integral along the straight line A’ B’ is equal but opposite to the integral along BA so 
that these two integrals cancel, leaving the following two integrals: 


f bode f o(z)dz=0. 
AA! B'B 


Recognizing the direction of the path integrals, 


f ooa +¢ o(z)dz=0 
c c 


— au < 
counterclockwise clockwise 


or 
$ ide $ Jast, (10.9) 
C (eu 


where the integrals in Eq. (10.9) are understood to be counterclockwise. For the circle 
C’ of radius p centered at a in Fig. 10.1, one can use polar coordinates so that z = 


a a The 


10.3 Taylor and Laurent series 293 


a+ pe’? and dz =ipe!® d9, and therefore 


27 27 
/ ooac= PO 9, = Prive a0 =i | f(@) dé. 
Cc’ C pe 0 


,ı(z—a) 0 


Since this result is valid for any p, one can let p approach zero for which 
z approaches a. As f(z) is both continuous and analytic at z = a, in the limit 
lim f(z) = f(a). Thus, the integral around C’ is evaluated as 
Za 


27 
$ ode=i so | d0 =2ni f(a). 
Cc’ 0 


Hence, substituting this result into Eq. (10.9) yields the final result in Eq. (10.7): 


f@= af fQ) dz. [answer] 
C 


2Q0i (z — a) 


10.3 Taylor and Laurent series 


Let f(z) be analytic inside and on a circle having its center at z = a. Then for all 
points z in the circle, the Taylor series representation of f(z) is 


1 


a (z—a)>+.... (10.10) 


f) = fa) + faz pet (z-a? + 


a) 
! 


f” (a) 
3! 


Example 10.3.1. If f(z) is analytic at all points inside and on a circle of radius R 
with center at a and if a + h is any point inside C, prove Taylor’s theorem that 


h? h? 
f(ath)= f(a)+hf'(a)+ ai @ + T Huen 


Solution. 
By Cauchy’s integral formula Eq. (10.7), 
1 
f(a+h)=— ADs (10.11) 
2mi Jo z—a-—h 
However, 
1 _ 1 
-a-h h 
z—a—h E [i E | 
z—a 
1 h h2 h” Arti 
= 1+ + +... + + À 
| (z-a) (z-a)? (z — a)" ECET] 


(10.12) 


a a 
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Substituting Eq. (10.12) into Eq. (10.11) and using the Cauchy integral formulas in 
Eq. (10.7) and Eq. (10.8) yields 


1 MGA h $ f@)dz i h $ f(z)dz 


2mxi Je z—a  2rijJc(z—-a)} `  2xi Je (z—a)"*! 


flat+h)= +R, 


2 n 
= f(a) +hf'(a)+ Tw +... + E foga) + Rn, 


h”+! d 
where R, = — $ f (z)dz 
Ç 


2ri (z —a)"+tl (z —a-— h) 
N : f f 
ow when z is on C, ENT < M and |z — a| = R so that by Eq. (10.4) since 
z—a— 
2x R is the length of C, 
|h\"*!M 
(Ral £ Reel OR 


Asn —> œ, |R,| — 0 and the required result follows. [answer] 


Singular points 
A singular point of a function f(z) is a value of z at which f(z) fails to be analytic. 
If f(z) is analytic everywhere in some region except at an interior point z = a, the 


point z = a is called an isolated singularity of f(z). For example, if f(z) = ———; 


z- 3)? 
the point z = 3 is an isolated singularity of f(z). 
Poles 
If f(z) = ( a , $ (a) £0, where ¢ (z) is analytic everywhere in a region includ- 
z—a 


ing z =a, and if n is a positive integer, then f(z) has an isolated singularity at z = a 
which is called a pole of order n. If n = 1, the pole is called a simple pole. If n = 2, 
the pole is a double pole. 


Zz 
For example, f(z) = ————~———— has two singularities: a double pole at z = 3 
ple, f(z) @-3°G 4) g p 
and a simple pole at z = —1. 
3z- 1 3z- 1 


As another example, f(z) = 
+27. 


= - — has two simple poles at z = 
244 +2 —2i) es 


A function can have other types of singularities besides poles. For example, f(z) = 
./z has a branch point at z = 0 (which follows since f(z) is a double-valued func- 


F ; sin z f : 
tion). Also the function —— has a singularity at z = 0. However, due to the fact that 
Z 


so BZ oe eoad ; ; 
uir — is finite, this is called a removable singularity. 
a? Z 


a a The 
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If f(z) has a pole of order n at z = a but is analytic at every other point and on a 
circle C with center at a, the function (z — a)” f (z) is analytic at all points inside and 
on C and has a Taylor series about z = a so that 


_ a-n a—n+1 a-1 
i= oan” Gawd Gan 
or (10.13) 

a_| a_2 
a) Gaar +... 


+ao+aı(z— a) +... 


f(z) =an +a z-a) + a(z- a)? +... + 


This series is called the Laurent series for f (z). Here ap + a1 (z — a) +.a2(z— a} +... 
is called the analytic part, while the remainder (consisting of the inverse powers of 
(z — a)) is the principal part. A Laurent series is more generally defined as 


[0,6] 


X ag(z—a)*, (10.14) 


k=—0o 


where the terms with k < 0 constitute the principal part. When the principal part of the 
Laurent series has a finite number of terms and a_, 4 0 while a—n—1, d—n—2, ... are all 
zero, then z = a is apole of order n. If the principal part has infinitely many terms, z = 


a is called an essential singularity. For example, the function e! = 14+ —+ 2 +... 
Z Iz 


has an essential singularity at z = 0. 


FIGURE 10.2 


Schematic of contours for the Laurent series. 


Consider Cı and C as two concentric circles centered at z = a, as shown in 
Fig. 10.2. If f(z) is analytic in the region i between the circles, then f(z) can be ex- 
panded as a Laurent series that is convergent in Ñ. In general, for a Laurent series, the 
power series converges inside a circle C2, while the inverse power series converges 
outside the circle Cı. Hence, the radius of C2 is the radius of convergence for the am 


e 
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series that converges for any value of |z| < p2, where pz is the radius of C2. The a,,_ 
series will converge for any value of |z| > 01, where p; is the radius of C1. 


Coefficients of a Laurent series 
The coefficients ap, a1, a2...4—1, 4_2... for the Laurent series expansion of f(x) can 
be evaluated as follows. 


aa — Se 
7 i 
/ P Q \ 
a) J 
\ ae Z 
i sul 


FIGURE 10.3 


Schematic of a circle Cı bounded within a region C. 


Let Cı be a circle of radius p centered at a (see Fig. 10.3). Since (z — a)” is 
analytic within and on the boundary of the region bounded by C and C], 


i =f dz =f dz 
"Je @-a)" Jo e-a" 


Using polar coordinates on C], |z — a| = p or z — a = pe'® and dz = ip e!? d0. The 
integral therefore equals 


i piel? d8 _ i (i o(-n)i8 rr 
o (pe) — ph Jo l 


Consider two cases for the integral: 


(i) (9 =1): =i fo a0 = 2a; 


(ii) 41): 
I -Í 1 [cme ao)" = 1 1 e] 
"pid= o pT =n) 
EE [d — n)27] +i sin[(1 — n)2x] —1 
= cos[(1 — n)2x sin[(1 — n)27] — 
pri (l —n) oe RERA al 
=+1 =0 


=0 for any integer n positive or negative except n = 1. 


So CCCCCCCC“‘( “™UUUUUCtsi(‘(“#l#éa“w»™_*N“XN|YUua’7’?'!'7/_/_ 
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Then the integral J,, yields 


dz 2ri, forn=1, 
= - ; (10.15) 
c (z—a)" 0, for n Æ 1, positive or negative. 


Now, consider the integral 


1 f 
J= d 10.16 
Ii $ -ayti to) 
with the Laurent series f(z) = aọ + a(z — a) + az(z — a) + ... + a J 
zZ-a 
ae + .... As mentioned, this series is convergent in a region Ñ within two con- 
z—a 


centric circles Cı and C2 centered on the point a (see Fig. 10.2), where C is any 
simple closed curve surrounding a and lying in ft. 


Thus, inserting f(z) into J gives 


= i > dm(Z — a)” ~ a-m 1 
= Qi C bs (z—a)"+! PE (z — a)” =| dz (10.17) 


m m 
1 am a-m 
= dz. 10.18 
Ani $ £ (z — arm + 2 (z = | K ( ) 


From Eq. (10.15), one only gets a nonzero result in the first term when n —m-+1= 1 
(that is, m = n), which evaluates as a, for the integral in Eq. (10.16). Hence, the 
integral J gives the coefficient ay: 


Similarly, with n + m + 1 = 1 (that is, m = —n), the coefficient a_, is given by 


ee eee 


2ri Jo (z-a) "t! 


In summary: 


e Ifall a_m coefficients are zero, then the Laurent expansion reduces to a Taylor 
series expansion, f(z) is analytic at z = a, and a is called a regular point. 

e If a_», =0 for all m >n, then f(z) is said to have a pole of order n at z = a. If 
n= 1, then f(z) is said to have a simple pole at z =a. 

e If there are an infinite number of nonzero a_,, coefficients, then f(z) is said to 
have an essential singularity at z = a. 


M E O” 
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e The coefficient a_; has a special meaning and is recognized as the residue of f(z) 
at z = a (see Section 10.4). 


Example 10.3.2. Find the Laurent series for the function = E about the singu- 
larity z = 1. 
Solution. 
Let z — 1 = u so that z = 1 + u. Then 
e eltu e" e u uw 
= =e. — 1 —+—+... 
Gat? a2 R s| Tee gT | 
e e e elz—1) e(z—1) 
= Gop + TEN + Ji 3i + 7i +. [answer] 


Thus, z = 1 is a pole of order 2 (that is, a double pole). The series converges for all 
values of z Æ 1. 

aE 
pecs + —— 


Example 10.3.3. Given f(z) =e =1+z+ 1 3] 


+..., what is the value of a? 
Solution. 
On inspection of the Laurent series in Eq. (10.13),a=0. [answer] 


Note that there is no series with terms containing a_. Thus, the function f(a) is 
analytic at z =a. In this example, f(a) = 1. Lastly, there is no a_; term, and hence 
the residue at z = a = 0 is zero. 


ic 1 1 1 1 
Example 10.3.4. Given f(z) = - =t aF z +..., what is the value 
z? z z 27 3! 4 
ofa? 
Solution. 


Again the value is identified as a = 0. [answer] 


However, in this example, there is a series containing the a_,, terms up to a term in 
1/23, thus a is a pole of order three. The residue is the coefficient a_, = `: 


10.4 Residue integration method 

The coefficients in Eq. (10.13) can be obtained by writing the coefficient of the Taylor 
series corresponding to (z — a)” f (z). The coefficient a—1, called the residue of f(z) 
at the pole z = a, is of considerable importance. It can be evaluated from 


: 1 q”! a 
eam i aa T OT); (10.19) 


ee Ee 
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where n is the order of the pole. For a simple pole 


a_, = lim (z — a) f (2). (10.20) 
z> a 
z2 
Example 10.4.1. Find the residues of the function f(z) = ———.~——~ at the 


(z — 2)(z2 + 1) 
poles z = 2, i, and — i. 


Solution. 
These are simple poles and Eq. (10.20) applies so that: 


Residue at z = 2: lim (z 42) a 
esidue at z = 2: Hm (Zz = 7; 
z> 2 (24222 +1) 5 

z i: 1- 2i, 


3 


TEST eG -D+H C-DAD 10 
i 


1+2i 
Residue at z = —i: han eee =" 
z= -i 


Z 
z—2)(z¥Ki(z—-i) (=i-2)(=2i) 10 


[answer] 


10.5 Residue theorem 

If f(z) is analytic in a region ʻi except for a pole of order n at z = a and C is a simple 
curve in Ñ containing z = a, f(z) has the form of Eq. (10.13). Integrating the series 
and using the previous result from Eq. (10.15), 


d 0  ifnÆl, 
r) ot ies i (10.21) 
c-a)" Qni ifn=l, 
it follows that 
$ f(z)dz =2mia_}. (10.22) 
C 


Thus, Eq. (10.22) states that the integral of f(z) around a closed path enclosing a 
single pole of f(z) is 27ri times the residue at the pole. 


Example 10.5.1. Prove the residue theorem in Eq. (10.22). 


Solution. 
Inserting the Laurent series in Eq. (10.13) into the integral fe f(z)dz yields 


f feords=§ (otae-o+at-a?.+ i ) a 
c Č 


(z-a) (z-a? ` 


= aolo tajh +a2h +... +a—1ıl-1 +a-2l—2 +... 


E 
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Since J_ is the only nonzero integral in accordance with Eq. (10.21), the final result 
of Eq. (10.22) follows. [answer] 


More generally, the residue theorem states that if f(z) is analytic within and on 
the boundary C of a region ‘t except at a finite number of poles a, b, c... within R, 
having residues a—1, b_1, c_1, ..., respectively, then 


$ f@)dz=2ilas +b bert). (10.23) 
i 


Cauchy’s theorem and integral formulas are special cases of this result. 


Example 10.5.2. Prove the more general residue theorem in Eq. (10.23). 


Solution. 

Let C1, C2, C3, and so on be contours, which each enclose one singularity, that is, C1 
encloses a, C2 encloses b, C3 encloses c, and so on. Furthermore, let C be a contour 
that makes cuts in the contours C1, C2, C3, etc., as depicted in Fig. 10.4. Moreover, 
since the straight-line paths in this figure are traversed in opposite directions, these 
straight-line sections cancel in pairs so that 


G 


C3 


FIGURE 10.4 


Multiple contours, each containing a singularity for evaluation of the residue theorem. 


f rox=¢ soad foa f f (2) dz... 
Cc Cı C2 C3 
=2ria-;ı + 2mib_| +2ric—1 +... 


where a—1, b—1, and c_, are the residues of f(z) at the singularities z = a, z = b, 
and z = c, respectively. Thus, the general result is 


$ f(z)dz = 2xi - sum of the residues of f(z) enclosed by C. [answer] 
C 


ee... Lh 
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The evaluation of various definite integrals can often be achieved by using 
the residue theorem together with suitable functions f(z) and a suitable path or 


contour C. 
27 
30 
Example 10.5.3. Show that f D jp 
o 5—4cos0 12 
Solution. 
zt+z! ei? + e318 


If z in polar form is z = 1 - e!?, then cos@ = 


, cos30 = ——____ = 
2 


z2 +z 
————, and dz = iz d0. The integral therefore becomes 


[ cos 36 w= (z? +z7°)/2 dz Lg ae 4 
o 5—4c080. Ic z4z \ iz aled- DR-) 


-E 


where C is the circle of unit radius with center at the origin (see Fig. 10.5). 


FIGURE 10.5 


Schematic of a circle C of unit radius. 


The integrand has a pole of order 3 at z = 0 and a simple pole z = 1/2 within C: 


1 d* 641 21 
Residue at z = 0: lim Zz . lies = —; 
z> 0 2! dz? pz- 1)(z — 2) 8 


z +1 = 65 
=- 


2241 
Residue at z= 1/2: lim | G27) oy 
z> 1/2 2 23(2z ~1)(z — 2) 


1 z+ 1 1, .f21 65 x . 

Thus, - dz = -(2xi) } — — — + = — by the residue 
2i Je 23(2z — 1)(z — 2) 2i 8 24 12 

theorem. [answer] 


10.6 Applications of conformal mapping 


Consider the analytic function w = f(z) = u(x, y)+iv(x, y). This function provides 
a transformation (or mapping) that establishes a correspondence between points on 


E 
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the wv and xy planes, where u = u(x, y) and v = v(x, y). If there is a mapping of 
curves C; and C2 on the xy plane into curves C| and C on the uv plane, where the 
angle between Cı and C2 remains unchanged in both magnitude and sense with the 
mapping to the curves C; and C4, then the mapping is said to be conformal (Spiegel, 
1964). Moreover, if f(z) is analytic with f’(z) Æ 0 in a region , it follows that 
the mapping w = f(z) is conformal at all points of R. Riemann’s mapping theorem 
explicitly states that there exists a function w = f(z) in the region that maps each 
point of A into a corresponding point of X’, as well as each point of the simple closed 
boundary curve in the z plane of K, into a so-called unit circle in the w plane (which 
forms the boundary of the region i’) (see Fig. 10.6). The correspondence for this 
mapping is one-to-one. 


Zplane w plane 
y v 
e 
c 
x u 
(a) (b) 
FIGURE 10.6 
Schematic for the Riemann mapping from (a) the z plane to (b) the w plane. 


Application to Dirichlet boundary value problems 

Boundary value problems in Chapter 5 require the solution of a partial differential 
equation with associated boundary conditions. Consider a Dirichlet problem, for in- 
stance, that requires a solution of a harmonic function @ of the Laplace equation in 
a simply connected region St’ bounded by a simple closed boundary curve C (see 
Fig. 10.6(a)). The solution function is subject to prescribed values on this boundary 
C. In fact, if C is a unit circle |z| = 1, a function that satisfies Laplace’s equation at 
each point (r, 0) with prescribed values F (6) on C’ (where ¢ (r, 0) = F(@)) in R is 
given by the Poisson formula for a circle (Spiegel, 1964): 


1 f* G=r)F@de 
oie) => | _ 
2 1 — 2r cos(0 — ġ) +r 
Similarly, for a function in the half-plane y > O (that is, Im{z} > 0), with prescribed 


values G(x) on the x axis (that is, (x, 0) = G(x), —co < x < ov), is given by the 
Poisson formula for a half-plane (Spiegel, 1964): 


_1 (®© yG@)dn 
pezi eee. 


es = 
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As an application of conformal mapping, the Dirichlet problem for any simply con- 
nected region % can be solved with a mapping function onto either the unit circle or 
the half-plane as follows: 


(i) Use the mapping function to transform the boundary value problem for the re- 
gion into a unit circle or half-plane. 
(ii) Solve the problem for the unit circle or half-plane. 
(iii) Employ an inverse conformal mapping function to obtain the solution. For 
example, if w = f(z), then there exists a unique inverse mapping function 
z = g(w) in Ñ provided f’(z) £0. 


Example 10.6.1. Determine the harmonic function ¢ (x, y) using the Poison formula 


1, x>0, ; 
on the upper half z plane, where G(x) = | 0 x<0 on the x axis. 
Solution. 

We have 


i if y G(n)dn -1f y[0]dn +f y[l]dn 
do +a- nJoy +a- mM ado y +a- 


i 1 
=1 tan™! (Z). 
T x 
0 


1 —x 
= — tan! 4 


T y 


Application of conformal mapping to engineering problems 
If (x, y) satisfies the Laplace equation 
rp ao 
ERLE ah, 10.24 
əx? dy? ( ) 


then the function ¢ is harmonic. Moreover, there must be a conjugate harmonic 
function (x, y) which is analytic, such that 


Q(z)= (x,y) +iv(, y). (10.25) 


The function Q is called the complex potential. 


Example 10.6.2. Using the Cauchy—Riemann equations for the given harmonic func- 
tions in Eq. (10.25), show that 


dQ ð 0 ð ð 
— = 0'(z) = i w $ i ua 
dz ox ox Ox oy 
Solution. 5 , 3 3 
The total derivatives of ọ and y are dọ = Wix + Ui anddy = D iy + OY ay. 
Ox dy Ox dy 
Hence, 


ana dorian = (504i) ax + (2 41S) ay. 


OOOO. 
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Using the Cauchy—Riemann equations in Eq. (10.2) yields 


dQ= (+i) ars -EE ay 


Ox Ox Ox Ox 
ap a 

= (3 +i) (dx + idy). 
Ox Ox 


Therefore, the result follows: 


dQ, _ (96 , dW) _ (90 _ , 06 
Ba-Wo=(2+i%)-(2 i). [answer] 


If a mapping such that w = f(z) = u(x, y) + iv(x, y) is analytic, the one- 
parameter families of curves in Eq. (10.25) are orthogonal if 


p(x, y) =a and w(x, y) = P, 


where œ and £ are constants. As shown in Fig. 10.7(a), each member of one family 
of curves is perpendicular to the other family of curves at the point of intersection. 
Moreover, the corresponding image curves in the w plane are parallel to the u and v 
axes, which also form orthogonal families in Fig. 10.7(b). Thus, when the mapping 
of f(z) is analytic, the angles between the intersecting curves in the z plane are equal 
in both magnitude and sense to the intersecting image curves in the w plane. This 
result is in fact the underlying basis of conformal mapping. 


z plane w plane 
v: 


FIGURE 10.7 


One-parameter families of intersecting curves in (a) the z plane and (b) the w plane. 


Applications of conformal mappings to various fields of engineering are briefly 
described in the following sections. 


Fluid flow 


The function Q (z) is important to characterize the flow pattern. If Vy and Vy are the 
components of the fluid velocity in the x and y directions, these quantities relate to 


es) = = 
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the velocity potential as 


Y= 2 and V, = oe (10.26) 
ox dy 
Moreover, if a fluid is incompressible so that its density pọ remains constant, the mass 
flux in the x and y directions is Fy = pV, and Fy = p Vy, respectively. In the steady 
state, if there is no accumulation of the fluid in a region C, then the amount of material 
entering the region is equal to the quantity leaving it, and from Eq. (5.8), 


dV, dV, 
— =0. 10.27 
ox oy ( ) 


This latter relation is simply a statement of conservation of continuity. Hence, 
Eq. (10.26) and Eq. (10.27) yield a velocity potential ¢@ that is harmonic and satisfies 
the Laplace equation in Eq. (10.24). Moreover, using the result of Example 10.6.2 
with Eq. (10.26), 

dp ð$ 


Q(z) = ax l ay = Vy i Vy. 


Consequently, the complex velocity is the complex conjugate such that V = V, +i Vy 


with the magnitude V = |V| = ,/V2+ V2. 


Conformal mapping is useful in obtaining the complex potential Q (z) in order to 
characterize the flow pattern. For instance, to account for flow around an obstacle, 
one can map a uniform flow in the w plane given by V,w (see Fig. 10.8(a)) using 
the mapping function w = z + a? /z. Here, the upper half w plane is mapped into the 
upper half z plane that is exterior to the circle C, as depicted in Fig. 10.8(b) (Spiegel, 
1964). 


w plane z plane 


FIGURE 10.8 


Mapping of (a) a uniform flow in the w plane to (b) a flow with an obstacle in the z plane 
(adapted from Spiegel, 1964). 


Other types of mapping functions are given in Spiegel (1964) for different appli- 
cations and problems. 


e 
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Example 10.6.3. The complex potential for a flowing fluid with a solid body im- 
2 

mersed into it is Q(z) = Və | z + a . The constant Vo is equal to the magnitude of 
Z 


the free stream velocity away from the immersed object and a is the radius of the 
object (see Fig. 10.8(b)). 


(a) Determine the equations for the streamlines and equipotential lines in Fig. 10.8 
letting z = re’. 

(b) Describe the flow around the circular object of radius a. 

(c) Find the velocity at any point and away from the object. 

(d) Determine the stagnation points. 


Solution. 
(a) We have 
2 2 2 
Q(z) = Vo (ve + Eee) = Vo (« + =) cos +i Vo (« = £) sind. 
r r r 


——— M eS 
$ 4 


2 
The stream lines are given by w = Vo ( = £) sin = B, which are indicated by the 
F 
curves showing the actual path of the fluid in Fig. 10.8(b). Similarly, the equipotential 
2 
lines are given by ¢ = Vo ¢ + £) cos@ =a, which are indicated by the dashed 
r 


lines that are orthogonal to the streamline family of curves in Fig. 10.8. 


(b) The circle r = a represents a streamline. Since there is no flow across a streamline, 
it represents the circular obstacle. 


2 2 
Q'(z) = Vo (1 — =) =V, (1 = se) 
Z r 
a? F V,a2 . 
=V, 1 — — cos 20 at | -z sin 26. 
r r 


- a? V,a? 
Thus, the complex velocity is V = V, (1 — =z cos 28) —i (=) sin20, with 
r r 


edeli E)n] 


2a? cos20 af 
=V,,/1 +—. 


(c) We have 


magnitude 


es = 
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Thus, far from the obstacle, as r > co, V = Vp. 
2 


(d) For a stagnation point, Q’(z) = 0 so that V, (: — =) =0 orz =a and z = —a, 
Z 


corresponding to points A and B in Fig. 10.8(b). 


Electrostatics 


A charge distribution will establish an electric field. The intensity of this field & can 
be related to an electrostatic potential @ as 


&=—-Vo¢. 
Hence, for a two-dimensional charge distribution, 


a a 
E, = ee d E, = E (10.28) 
ox $ ðy 


In the z plane, using Eq. (10.28), the electric field intensity is 
G@=E,+iEy=—-— —-i—. (10.29) 


Using Stokes and Green’s theorems from Section 4.4.2, along with Maxwell’s equa- 
tion V x E = 0 (Jackson, 1975) with the definition of the vector cross-product, it 
can be shown that the tangential component of the electric field intensity for a simple 
curve C in the z plane is given by 


$ E, ds = § E; dx + Ey dy =0. (10.30) 
Cc Cc 


Similarly, using the divergence and Green’s theorems, with Maxwell’s equation V - 
E =Q (Jackson, 1975) (that is, for the case that there is no charge in any region) with 
the definition of the vector dot product, the normal component of the electric field 
intensity is given by 


f Enas= E, dy — Ey dx =0. (10.31) 
E Cc 


It also follows directly from Maxwell’s equation that with no charge 


dE,  dEy 
— =0. 10.32 
ox oy ( ) 


Using Eq. (10.28) and Eq. (10.32), one obtains the Laplace equation in Eq. (10.24) 
for the electrostatic potential. As such, @ is harmonic for all points not occupied by 
any charge. Consequently, there must be a harmonic conjugate function w, where Q 
is the complex electrostatic potential such that 


Q(z) = h(x, y) +iy x, y). 


a U‘ 
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Thus, from Eq. (10.29) 


ə — 
ox i oy ox ii dy @) ( ) 


Again, equipotential lines and flux lines are given, respectively, by 


(x, y) =a and w(x, y) = $. 
Heat flow 
The heat flux across a surface is given by Fourier’s law in Example 5.1.1: 


Q=-KV¢, 


where ¢ is the temperature and K is the thermal conductivity of the solid material. 
One has a similar relationship for the temperature as for the potential in electrostatics, 
where in two dimensions, 


0 0 
Q=Q,+iOy=-K (24:2), (10.34) 
ox oy 
where for the second relation one has used 
0 0 
dazk and Og (10.35) 
Ox É dy 


Analogously, for a closed simple curve C in the z plane (that represents the cross- 
section of a cylinder), the tangential and normal components of the heat flux for 
steady-state conditions, in which there is no accumulation of heat inside of C and no 
sinks or sources, are 


f Onds= 4 O, dy — Q, dx = 0 and J O,ds=$ O,dx + Qydy=0. 
Cc Cc C Cc 


(10.36) 
The first equation of Eq. (10.36) gives the similar result as for electrostatics: 
dO, ə 
Qs + 3L» o, (10.37) 
ox dy 


Hence, using Eq. (10.35) and Eq. (10.37), one obtains the Laplace equation for the 
temperature where ¢ is harmonic. Again one has the harmonic conjugate function W: 


Q(z) = h(x, y) +iy x, y). 


Similarly, the same families of curves are identified as isothermal lines and flux lines 
for the complex temperature Q (z): 


p(x, y) =a and w(x, y) = $. 


oo OO O 
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Thus, the methodology for solving temperature problems is analogous to that of elec- 
trostatics. 


Summary 


As a summary of results, the functions ¢ (x, y) = œ and w(x, y) = B yield the analo- 
gous quantities in fluid flow, electrostatics, and heat flow, where, respectively: 


(i) For steady fluid flow, these relations give rise to equipotential curves and stream 
curves. The stream curves represent the actual path fluid particles take in a flow 
pattern. 

(ii) For electrostatics, they result in equipotential lines and flux lines. 
(iii) For heat flow, they give isothermal lines and flux lines. 


Problems 


1 
10.1 Expand the function f(z) = ——————— in a Laurent series, which is valid 
(z +1) + 3) 
for 1 < |z| <3. 


Hint: Use partial fractions and a binomial series expansion, (1 +.x)~! = 1 — x +x? — 
x?..., for x values in the interval of convergence of —1 < x < 1. 


10.2 Consider the integral — where C is a circle of radius 1/2 and centered 
at 1, and positively oriented. 
1 1 
z—-1oz-1 ztl 
(b) Evaluate the integral using Cauchy’s integral formula and Cauchy’s theorem 
with these partial fractions, respectively. 


(a) Using a partial fraction method, show that 


10.3 The complex inversion formula in Eq. (3.9) with a branch point s = 0 yields 
Spiegel, 1971 aN \ peg feJ- | Thi 
egel, ISL = er = erfc į ——~ 7. [his trans- 
(Spiegel, 1971): f0) : { | =ere{ “I 
form is used in Example 5.2.11. 
(a) Show that %7! fev} = —* _4-3/2g-a"/(40)_ To find this transform, use the 
2/1 


Laplace transform of a derivative Z { f’(t)} = s F (s) (that is, for f (0) = 0), with the 
eas 


a 
2/t 


10.4 Given the function u(x, y) = x? — y?. 


previous result f(t) = L'{F(s)} = erfc | |: where F(s) = 


(a) Prove that this equation satisfies the Laplace equation. 
(b) Find the function v(x, y) such that f(z) = u + iv is an analytic function. 


10.5 Show that f(z) = sinz is analytic in the following steps: 
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(a) Find the real and imaginary parts of f(z). 


(b) Show that u(x, y) and v(x, y) from part (a) satisfy the Cauchy—Riemann con- 
ditions. 


10.6 Evaluate the integral fc z? dz along the path shown in Fig. 10.9. 


(0 


FIGURE 10.9 


Schematic of a closed semicircle contour C. 


d 

10.7 Evaluate the integral $. Da dz, where C is a circle of radius b centered 
z“—a 

at the origin with b >a. 


+œ dx 
10.8 Evaluate the definite integral r 
-œ (1+x?) 


the x axis from —p to +p, and then closes by a semicircle in the upper half-plane of 
radius p centered at the origin. 


where the contour runs along 


10.9 Show that using contour integration and the residue theorem, the definite inte- 
; —V2n 
{> —el YX dw : i i œ) 3 e for x < 0, Thi 
gral ————~ is evaluated asg (x) = is 
-œ (w?—iw+2) —./ In =e 


e for x > 0. 


3 
result is used in the solution of Problem 3.10. 


CHAPTER 


Nondimensionalization 


A mathematical model or an equation describes the behavior of a real-life system. A 
model is described by dependent and independent variables, and parameters. When 
a system is defined in terms of position as a function of time, the position acts as the 
dependent variable and time as the independent variable. On the other hand, parame- 
ters, in general, can be constants or vary in a model. A variable instead of a parameter 
can be used if the value changes continuously (e.g., over time). A parameter can be 
used also if the value does not change or only changes at particular moments of time. 
A solution is sought to determine the relation between dependent and independent 
variables in a model. 

Variables and parameters used in modeling the behavior of a real-life system rep- 
resent physical properties and hence have physical dimensions. Both sides of an 
equation must have the same dimension (sometimes called the dimensional homo- 
geneity). Wherever a sum of quantities appears, all terms in the sum must have the 
same dimension. 

Modeling a real-life system may become difficult with too many parameters and 
variables. Consequently, techniques are applied to reduce the number of variables/pa- 
rameters while simplifying equations. A dimensional analysis is applied to relate 
parameters/variables with each other while developing a nondimensional expression 
between independent and dependent variables/parameters. Conversely, a nondimen- 
sionalization technique is applied to reduce the number of parameters in a system. 
In both techniques (i.e., dimensional and nondimensionalization analyses), the phys- 
ical dimensions of variables and parameters can be fruitfully exploited, resulting in a 
simplified nondimensional equation or expression. 

Dimensional analysis is used to check if dimensional homogeneity is achieved. 
In addition, with the application of Buckingham’s z theorem (which is a tool for 
dimensional analysis), nondimensional expressions are developed to relate dependent 
and independent variables/parameters. In Buckingham’s x theorem, no definite basic 
equation is defined but rather relationships. 

Nondimensionalization is a technique that is applied to remove, either fully or 
partially, dimensions from an equation by substituting in relevant variables. Nondi- 
mensionalization finds application, in particular, for systems that can be described 
by differential equations. It is employed to: (i) generalize a problem; (ii) simplify 
an equation by reducing the number of variables; (iii) rescale parameters and vari- 
ables so that quantities are of similar order/magnitude; and (iv) analyze the system 
regardless of units used to measure variables. 
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Dimensionless numbers, such as those used in fluid mechanics (for example, the 
Reynolds number, defined as the ratio of inertia forces to viscous forces) and heat 
transfer (e.g., the Prandtl number, defined as the ratio of momentum diffusivity to 
thermal diffusivity), are outcomes of simplifications, such as nondimensionalization 
of differential equations. These numbers can be used to identify the relative impor- 
tance of terms. For example, the Reynolds number will indicate which force (i.e., 
inertia forces or viscous forces) becomes dominant for a given condition of fluid ve- 
locity, kinematic viscosity, and characteristic length. 

Experiments cannot be always performed on one-to-one scale prototypes; exper- 
imental systems need to be scaled down. In addition, sometimes, different fluids are 
used in the modeling of a system for cost saving purposes. Nondimensionalization 
or dimensional analysis techniques need to be applied to facilitate a scale-up of lab- 
oratory results to the actual system (for example, the scaling up of a modeled wind 
tunnel experiment). The use of dimensionless numbers facilitates this scale-up. 

In the following sections, dimensional analysis and Buckingham’s x theorem will 
be discussed, followed by similarity laws and the nondimensionalization technique. 


11.1 Dimensional analysis 


The dimension of a physical quantity can be expressed by the product of basic phys- 
ical dimensions such as mass (M), length (L), time (T), absolute temperature (©), 
amount of a substance (N), electric current (I), and luminous intensity (J). These 
physical dimensions are basic as they can be easily measured in experiments. Di- 
mensions are not the same as units. For example, the physical quantity of speed can 
be measured in units of kilometers per hour or miles per second, but regardless of its 
unit, speed is always defined as a length divided by a time, or simply L/T. It should be 
noted that some quantities do not have dimensions. These quantities include: trigono- 
metric and exponential functions, logarithms, and counted quantities. Despite the fact 
that angles are dimensionless, they have units, such as a degree. 


Example 11.1.1. Determine the dimensions of the following quantities: area, 
(area) !/ 2. (volume)?, density, acceleration, force, pressure, work, kinetic energy, 
charge, and electric field. 


Solution: 


Area: L? (area)!/2: L (volume)?: L? 
Density (= mass/volume): M/L? 

Acceleration (= velocity/time): L/T? 

Force (= mass x acceleration): M-L/T? 

Pressure (= force/area): M/L-T? 

Work (= force x distance): M-L?/T? 

Kinetic energy (= mass x (velocity)?/2): M.L?/T? 


O [IU 
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Charge (= current x time): I-T 
Electric field (= force/charge): M-L/I-T? 


Example 11.1.2. Determine the SI units of the following quantities: density, force, 
pressure, (kinetic) energy, and electric field. 


Solution: 


Density: kg/m? 

Force: kg-m/s? 
Pressure: kg/m-s” 
Energy: kg-m?/s” 
Electric field: kg-m/A-s* 


Example 11.1.3. Determine the dimensions of k in the expressions cos (./k/m - t), 
ett, and 10%, where m is mass, f is time, i is the imaginary unit, and x is length. 


Solution: 


k = M/T? in expression cos (./k/m - t) 
k =T7! in expression e!* 


k =L"! in expression 10‘ 


Dimensional analysis is also applied to ensure the dimensional homogeneity of an 
equation in which both sides must have the same dimension. 


Example 11.1.4. Check the dimensional homogeneity of the equation written for 
pumping power: 


7 1, ; 
Q + W = m(Houtlet — Hintet) + 5 Wouter = VP ler) + mg (Zoutlet — Zinlet), 


where Q is the rate of heat input to the coolant in J-s~!, W is the rate of work input to 
the coolant in J- s7}, m is the mass flow rate inkg-s~!, H is the enthalpy inJ-kg™!, 
v is the velocity in m-s~!, g is the gravitational acceleration, which is 9.81 m - s7? 


and z is the height in m. 


Solution: 
We have 
O +W = h (Houe Ha) + = h Pater — Pater) 
TAM outlet inlet 2 m outlet — Yinlet 
=~ ~e saa amas “sa i 
J J kg L kg m2 
s s s kg s “eos 
E (Zoutlet — Zintet) - 


OOOO. 
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The Joule (J) is defined as the energy transfer to (or work done on) an object when a 
force of 1 Newton (= kg - m - s7? in SI units) acts on that object in the direction of its 


motion through a distance of 1 meter. It can be further expressed as J = kg - m? - s~?. 


Therefore, the units on both sides of the equation are equal and given by kg - m*-s~?. 
Using dimensions of the terms, instead of units, would generalize the application of 
the dimensional homogeneity. Therefore, it can be demonstrated that both sides have 


the dimensions of M- L? - T7?. 


Differentiation of a quantity with respect to another adds the dimension of the 
variable in the denominator. For example, the derivative of position with respect to 
time, (dx/dt), is the velocity and has the dimension of L - T~!. The second deriva- 
tive with respect to time, (d*x/dt*), is the acceleration and has the dimension of 
ler, 

Polynomials of mixed degree with dimensionless coefficients should satisfy di- 
mensional homogeneity. For example, a projectile motion can be expressed as 


y =ax + bx’, 1.1) 


where a and b are constants, y is the vertical displacement with dimension L, and x 
is the horizontal displacement with dimension L. Dimensional homogeneity can be 
satisfied only if the a and b coefficients have the correct units; otherwise if x = 2 
meters, the right-hand side of the equation becomes a(2 m) + b(4 m7) and the left- 
hand side, y, is given in meters. A closer look into the projectile motion (see Fig. 11.1) 
sheds light on how to choose these coefficients. 


A y 


FIGURE 11.1 


Projectile motion. 


es el l..lLLU TL Eee 
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At any time, t, with the known initial launch angle @ and initial velocity ug, the 
projectile’s horizontal and vertical displacement are given by 


x = vot cos(0), (11.2) 


1 
y = vot sin(9) — 58h (11.3) 


If time, t, is eliminated from Eq. (11.2) and Eq. (11.3), then the vertical displacement 
can be expressed in terms of the horizontal displacement as 


& 2 
= tan(0) - x — —~——_.. 11.4 
á @ 2v3 cos? (0) A 
Since g, 0, and vo are constant, the coefficients a and b in Eq. (11.1) become 
a = tan(0), ba- a (11.5) 
2v3 cos? (0) 


The coefficient a does not have any dimension, whereas the coefficient b has the 
dimension of L~!. In this case, the dimensional homogeneity is sustained with both 
sides of the equation having a dimension of L. 

Another application of dimensional analysis is the conversion from one unit to 
another. For example, the inch and cm are both units of length, and | inch = 2.54 cm. 
By dividing both sides by the same expression, one gets 1 = 2.54 cm/1 inch. Given 
that any quantity can be multiplied by 1 without changing it, the expression of 
“2.54 cm/1 inch” can be used to convert it from inch to cm by multiplying it with 
the quantity to be converted. Therefore, 10 inch will be 25.4 cm (= 10 inch x 
2.54 cm/1 inch). 


Example 11.1.5. Convert 50 miles per hour into meters per second (note that 1 mile 


= 1609.344 meters). 

Solution. 

We have 
50 mile 1609.344 meter 1 hour _ meter 
thor 1 mile i 3600 second second 


11.2 Buckingham’s x theorem — dimensional analysis 


Buckingham’s z theorem is key in dimensional analysis. It provides a method for 
computing sets of dimensionless parameters from given (dimensional) variables (and 
parameters), even if the form of the equation is unknown. 

Nondimensionalization is useful only when the equation (for example, a conser- 
vation equation) is known. In many real-world phenomena, the equations are either 
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unknown or too difficult to solve. Experimentation is the only method of obtaining 
reliable information. In most experiments, scaled models! are used due to time and 
cost saving purposes. Experimental conditions and results need to be scaled properly 
so that results are meaningful for the full-scale prototype. 

Unlike the nondimensionalization technique, which is applied to basic (funda- 
mental) equations, the analysis of Buckingham’s z theorem is based on defining 
relationships between variables. The concept is based on the assumption that the 
space of fundamental and derived physical units forms a vector space over the ratio- 
nal numbers. The fundamental units are basis vectors, with multiplication of physical 
units as the “vector addition” operation, and raising to powers as the scalar multipli- 
cation operation. Making the physical units match across sets of physical equations 
can then be regarded as imposing linear constraints in the vector space of the physical 
units. Therefore, a dimensionless variable is a quantity with fundamental dimensions 
raised to the zeroth power (the zero vector of the vector space over the fundamental 
dimensions). 


The significance of Buckingham’s z theorem is that it: 


e allows analysis of a system when little is known about the physics acting on it; 

e allows measurements to be generalized from a particular set-up of the system; 

e describes the relation between the number of variables and fundamental dimen- 
sions; 

e generates nondimensional parameters that help in the design of experiments 
(physical and/or numerical) and in reporting of results; 

e predicts trends in the relationship between parameters; 

e reduces the number of variables that must be specified to describe an event, which 
often leads to an enormous simplification; 

e provides a similarity law index for the phenomenon under consideration so that 
prototype behavior can be predicted from model behavior (discussed in the next 
section). 


Buckingham’s z theorem states: “If there are n variables (and parameters) in a 
problem and these variables (and parameters) contain m basic dimensions (for ex- 
ample M, L, T), the equation relating all the variables will have (n-m) dimensionless 
groups.” Buckingham referred to these groups as I groups. 

The dimensional complete set of variables and parameters is expressed as 


fF (X1, +++ Xn) =O. (11.6) 
This expression can be replaced by the following set of nondimensional groups: 


FU, --- » Hn-m)) = 9, (11.7) 


1 See the next section for similarity. 
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where the I; are dimensionless parameters constructed from the dimensional x; by 
(n-m) dimensionless equations—the so-called IT groups—of the form 

I; = x xa a. (11.8) 
The exponents a; are rational numbers; however, they can always be taken to be 
integers by redefining I]; as being raised to a power that clears all denominators. 

A dimensional matrix D whose rows are the basic dimensions and whose columns 
are the dimensions of the variables can be formed for a system of n dimensional 
variables (with physical dimensions) in m basic dimensions. For dimensionless 
I; = x1 x3 e Xn”, a vector a = [a1 , az, + +- an] is sought such that the matrix-vector 
product Da equals the zero vector. 

Nondimensional II parameters can be generated by several methods, but the 
“method of repeating variables” is discussed here. While using the theorem, the fol- 
lowing steps are applied: 


e List relevant variables/parameters. Clearly define the problem, identify all the vari- 
ables that are important, and determine which is the main variable of interest (that 
is, the dependent variable). Indicate the total number of independent and depen- 
dent variables and parameters, n. 

e Indicate basic dimensions of each parameter. Express fundamental dimensions of 
each of n variables/parameters, and hence determine the number of basic dimen- 
sions, m. 

e Determine the number of I groups, which is expressed by (n-m). 

e Choose repeating variables: While choosing the repeating variables, the following 
considerations need to be taken into account: 

— never pick the dependent variable; 

— never pick variables that are already dimensionless; 

— never pick two variables with the same dimensions; 

— chosen variables must represent all the primary dimensions; 

— choose common variables as they may appear in each of the II groups; 
— choose simple variables over complex ones. 

e Form dimensionless IT groups and check that they are actually all dimensionless. 
Combine repeating parameters into products with each of the remaining parame- 
ters, one at a time, to create the IT groups. 


Example 11.2.1. Investigate the pressure drop in a pipe using Buckingham’s 7 the- 
orem. 


Solution: 
Independent variables and parameters, with their dimensions in capital letters, are 


e pressure drop, [AP], M-L7!.T-?; 
e viscosity, [u], M-L-!.T7!; 
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e density, [o], M-L73; 
¢ length, [/], L; 

e diameter, [D], L; 

e velocity, [v], L-T7!, 


where pressure drop, [A P], is the dependent variable. 


The model is of the form 
S(AP, n, p,l, D, v) =0. (11.9) 


There are m = 3 basic physical dimensions in this equation, i.e., time (T), mass (M), 
and length (L), and n = 6 dimensional variables, i.e., AP, u, p, l, D, and v. Conse- 
quently, there will be 6 — 3 = 3 dimensionless II parameters. The variables / and D 
cannot be chosen as repeating variables at the same time. Both of these variables have 
the same dimension (repeating parameters by themselves should not be able to form a 
dimensionless group) and D is chosen as the repeating variable. Repeating variables 
should be less complex than the prime quantities. In this case, u can also be chosen 
as prime quantity, as it is more complex than the others. Therefore, A P, which is the 
dependent variable of interest, and / and u can be chosen as prime quantities. The 
rest, D, p, and v, are chosen as the repeating variables. Based on this analysis, the 
form of the model is 


Ho = f (1, H2), (11.10) 
where 
To = A P” p@ v8 D“, (11.11) 
Mi = 1! p2h D”, (11.12) 
I = u“! pv D“, (11.13) 


For the dimensionless Io parameter, the dimensional D matrix, where the rows 
correspond to the basis dimensions T, M, and L and the columns to the dimensional 
variables AP, p, v and D, is given by 


AP ọp v D 
T -2 0 -1 0 
M 1 1 0 0 
L -1 -3 1 1 
or 
—2 -1 0 
D=] 1 1 0 O (11.14) 
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The matrix-vector multiplication is shown below: 


-2 -1 0|” 
1 1 0 ojj% |=0 0, 0), (11.15) 
= 3 i es 
a4 
which leads to the following linear equations: 
—2a; —a3=0, 
aj +a =0, (11.16) 


—a, — 3a2 + a3 + a4 = 0. 


Taking the index of prime quantity a; = 1, the other indices become a2 = —1, 
a3 = —2, and a4 = 0. The vector, a, becomes 
1 
gee) 1 (11.17) 
—2 
0 


Substituting the values of the vector a as exponents in Eq. (11.11), 


To = AP! p7!u~*D°. (11.18) 
Rearranging Eq. (11.18), 
AP 
Io = —,. (11.19) 
pv 


For the dimensionless II; parameter, the dimensional D’ matrix, where the rows 
correspond to the basis dimensions T, M, and L and the columns to the dimensional 
variables /, p, v, and D, is given by 


l p v D 
T 0 0 -1 0 
M 0 1 0 0 
L 1 -3 1 1 
or 
0 -1 0 
D=|0 1 0 0l. (11.20) 


n 
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The matrix-vector multiplication is shown below: 


0 -1 0 4 
o ı 0 O||.7]=[0, 0,0, (11.21) 
i =< a. e 
b4 
which leads to the following linear equations: 
—b3 =0, 
bo =0, (11.22) 


bı — 3b2 + b3 + b4 = Q. 


Taking the index of the prime quantity, bı = 1, and with b2 = 0 and b3 = 0, b4 
becomes equal to —1. The vector, b, can therefore be expressed as 


1 
b= i (11.23) 
—1 
Substituting the values of the vector b as exponents in Eq. (11.12), 
Mı =1! pv? D7!. (11.24) 
Rearranging Eq. (11.24), 
l 
M = D (11.25) 


Similarly, the matrix-vector multiplication for c indices (for TI2) is written as 


A 0 =1 0)|" 
1 1 0 Of) [=[0, 0, 0), (11.26) 
= ea ee ae hig 
c4 
leading to the following linear equations: 
—cı — c3 = 0, 
cı +c. =0, (11.27) 


—cy — 3c2 + c3 + c4 =0. 


Taking the index of the prime quantity, cı = 1, the other indices become c2 = — 1 
and c3 = — 1, and c4 becomes equal to —1. The vector, c, can therefore be expressed 


ee = = = = — 
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as 


1 
=] 
e=] (11.28) 


-1 
Substituting the values of the vector ¢ as exponents in Eq. (11.13), 


_ H 
pvD` 


T2 (11.29) 
It should be noted that the term pvD/u, which appeared in Eq. (11.29), is the 
Reynolds number (Re), and the Darcy friction factor, fp, in a pipe flow can be ex- 
pressed in terms of the Reynolds number as fp x 1/Re = 1/(puvD/w). 

Inserting I groups into Eq. (11.10), the relationship between dimensionless pres- 
sure drop and the independent variables, in terms of dimensionless groups, is ex- 


pressed as 
AP l u 
a =~ F\ 
pu D pvD 
or 
AP l 1l 
Ta DR) i 


Eq. (11.30) can also be written as a relationship, rather than as a functionality, as 


AP l 1f 
a C (5) (x) , (11.31) 


where C is the constant of proportionality and k is any number which is a constant. 


11.3 Similarity laws 


In hydraulic and aerospace engineering, fluid flow conditions are tested with scaled 
models to study complex (fluid dynamics) problems where calculations and computer 
simulations are not reliable. Scale models are usually smaller than the final design to 
allow testing of a design prior to building, which in many cases is a critical step in 
the development process. 

While constructing a scale model, an analysis needs to be applied as some pa- 
rameters need to be altered to match the conditions of interest to the prototype. The 
geometry may be simply scaled, but other parameters, such as velocity, pressure, 
temperature, and type of fluid may need to be altered. Similarity is achieved when 
testing conditions are created such that the test results are applicable to the real 
design. Similarity implies a certain equivalence between two physical phenomena. 
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Under particular conditions where certain relationships (in the form of dimension- 
less groups) are established and satisfied, the two cases are considered similar. For 
experimental purposes, different systems that share the same description in terms of 
the dimensionless numbers are equivalent. Two systems with similar dimensionless 
numbers are called similar. Therefore, similarity is achieved between the model tested 
and the prototype to be designed, if all relevant dimensionless parameters have the 
same corresponding values for the model and prototype. 


Similarity between the model and prototype is achieved if the following condi- 
tions are satisfied. 


Geometric similarity: The length dimension L must be ensured. In general, geometric 
similarity is established when a model and prototype are geometrically similar if and 
only if all body dimensions in all three coordinates have the same linear scale ratio: 
each dimension must be scaled by the same factor. Therefore, all of the angles and 
orientations of the model and prototype with respect to the surroundings must be 
identical. 


Kinematic similarity: The velocity at any point in the model must be proportional, 
i.e., the model and prototype have the same length and time scale ratio. Thus, the 
velocity scale ratio will be the same for both. Fluid flow of both the model and the real 
application must undergo similar time rates of change in motion (fluid streamlines are 
similar). Geometric similarity should be met before achieving kinematic similarity. 


Dynamic similarity: The same ratio should be established between resultant forces 
at corresponding locations on boundaries of the model and the prototype system. 
Dynamic similarity is achieved when the model and prototype have the same length 
scale ratio, time scale ratio, and force scale (mass scale) ratio. Therefore, the other 
two similarities should be met before achieving dynamic similarity. 


As shown in Example | 1.2.1, certain dimensionless numbers appear as a result 
of simple analyses and are associated with a particular type of force, such as the 
association of the Reynolds number, Re, with viscous and inertia forces. Therefore, 
to ensure that inertia and viscous forces are in the same ratio in the model and the 
prototype, the Reynolds number must be the same on both. Most of the problems 
require geometric and dynamic similarity. Dynamic similarity exists simultaneously 
with kinematic similarity if the model and prototype force scale ratios are identical. 
These similarities are, in fact, ratios (independent dimensionless groups), based on 
the x theorem. Since Eq. (11.8) is entirely general, it applies to any system, which 
is a function of the same variables. Therefore, a true model can be designed and 
operated with a matching dimensionless dependent II parameter to the prototype, 


(110) prototype = (Ilo) model: (11.32) 


a = 
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if the dimensionless independent II parameters are equal, 


dI ) prototype = (111) model» 


(11.33) 
(Hin —m)) prototype = Men —m) Vnoder 


Example 11.3.1. Consider the following engineering problem: Sea water at 10°C 
close to the surface (atmospheric pressure) will be pumped from location A to B 
using a pipe system, which is horizontal without elevation and form losses. In order 
to pump the water at a speed of 0.5 m/s, the pressure drop needs to be known to 
determine the pump power. Consequently, a model of 1/10 scale with a similar //D 
ratio is built to determine the pressure drop. The water used in the model system is 
fresh water at 25°C. In order to ensure the dynamic similarity between the model 
and prototype, what would be the velocity of the water in the model system and 
what would be the pressure drop in the seawater pumping system if it is found that 
the pressure drop in the model is 50 kPa when the model is built and tested for the 
pressure drop? 


Solution: 

The Buckingham z theorem as described in Example 11.2.1 shows that the pressure 
drop in a pipe can be described (see Eq. (11.30)) with two dimensionless numbers 
(that is, //D and the Reynolds number) and one dependent variable (A P/ (pv’)). 
Since the dimensionless parameters will stay constant for both the model and the 
prototype, they will be used to formulate scaling laws for the model. Geometric sim- 
ilarity, which is the length-to-diameter ratio (1/D), should be satisfied while building 
the model. Dynamic similarity can be satisfied by equating the Reynolds numbers: 


M 


prototype 


model 


The properties of seawater at atmospheric pressure and 10°C are p = 1025.0 kg/m? 
and u = 0.00148 kg/m-s, while for fresh water at 25°C they are p = 997.1 kg/m? and 
u = 0.0008899 kg/m-s. Hence, 


(eee?) (11.34) 


(ea v Or) 
0.00148 model À 


0.0008899 


prototype 


Rearranging Eq. (11.34), the velocity ratio is 


Vmodel __ e= O 


10 =6.18. 
997.1 /\ 0.00148 ) x 


Uprototype 


For the seawater flowing at a speed of 0.5 m/s in the prototype (pipe), the speed of 
fresh water in the model will be 3.09 m/s. 
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Given that //D and the Reynolds number (Re) are the same for both the prototype 
and the model, the nondimensional pressure drop term should be the same, which is 
given as 


(25) -= (25) (11.35) 
p v? prototype ~ pv model ` 


Rearranging Eq. (11.35), 


(2) 
AP protoiype _ prototype _ 2562 _ 9 oz. (11.36) 
AP model (o v?) 9523.8 
model 


For the tested pressure drop of 50 kPa in the model, the pressure drop in the prototype 
will be 1.35 kPa. 


11.4 Nondimensionalization technique 


Unlike the Buckingham z theorem in the dimensional analysis, the form of the equa- 
tion should be known in nondimensionalization. The sizes of certain dimensionless 
parameters indicate the importance of certain terms in an equation, providing possi- 
bilities to neglect terms. Nondimensional equations can be applied to similar systems 
where the only changes are those of the basic dimensions of the system. Proper 
scaling parameters, with a suitable combination of parameters and constants of the 
equations, need to be selected so that the resulting equation can be dimensionless 
and represent the characteristics of the physical domain. While nondimensionalizing 
an equation, nondimensional numbers/parameters (e.g., the Reynolds number) often 
appear. 


One can nondimensionalize a system of equations by taking the following steps: 


— 


. Identify all the independent and dependent variables. 

2. List primary dimensions of all dimensional variables and parameters/constants. 

3. Select characteristic units (that is, scaling parameters) to nondimensionalize vari- 
ables: If x is a variable, then xe is the characteristic unit used to scale the variable 
x, where x, has the same dimension as x. 

4. Replace each of the dimensional variables with a quantity that is scaled relative to 
a characteristic unit. 

5. Divide through by the coefficient of the highest-order polynomial or derivative 
term. 

6. Choose the characteristic unit for each variable to make as many coefficients as 

possible unity. 


re = 
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Example 11.4.1. Apply the nondimensionalization technique to the following first- 
order ordinary differential equation with constant coefficients: 


dx 2 
ree ; (11.37) 


Solution. 


e Identify all the independent and dependent variables: x is a dependent variable 
and ¢ is an independent variable. 

e List primary dimensions of all dimensional variables and parameters/constants: 
[x] = L; [t] = T. 

Note that in order to satisfy the dimensional homogeneity, the units of all the con- 
stant coefficients a, b, and c must be different in order to be added in Eq. (11.37): 
[a] = L - T7}; [b] = T7}; [c] = L7! TI. 

e Select characteristic units (that is, scaling parameters) to nondimensionalize vari- 
ables: A constant value can be selected for the characteristic unit (x¢) as x9 and 
for te as fo. 

e Replace each of dimensional variables with a quantity scaled relative to a char- 
acteristic unit. Given that the objective is to replace the dimensional quantities or 
variables with nondimensional variables, € is chosen as the nondimensional vari- 
able for x, and t is chosen for t. Therefore, the dimensional variable in terms of 
nondimensional variables and characteristic units are expressed as 


x=Exo, t=Tho. (11.38) 
Inserting Eq. (11.38) into Eq. (11.37), 


xod 
NOME yi (bxo)é + (cx0)E?. (11.39) 
to dT 
e Divide through by the coefficient of the highest-order polynomial or derivative 
term. The coefficient of the highest-order term is in front of the first derivative 
term. Dividing by this coefficient gives 


dé ato 2 
— = — + (bto) + (cxoto)é^. (11.40) 
dt XO 
e Choose the characteristic unit for each variable to make the coefficients of as many 
terms as possible unity. By choosing the constant term (i.e., the first term in the 
right-hand side of Eq. (11.40)) and the coefficient in front of & (i.e., the second 
term in the right-hand side of Eq. (11.40)) equal to 1, 


1 ato a 
bto = 1 > t0 = —, —=1>x=-. (11.41) 
b Xo b 


The coefficient of E? (i.e., the last term in Eq. (11.40)) becomes k = ca/b*. 
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Finally, the nondimensional equation can be expressed as 


de 2 

—=14+8 + kE’. (11.42) 
dt 

Instead of three undetermined parameters (a, b, c) in the dimensional version (that is, 

Eq. (11.37)), the nondimensional version has only one parameter, x. 


Problems 


11.1 Determine the variables on which a pendulum’s amplitude (or sometimes 
called the pendulum position) depends (see Fig. 11.2). Hint: Use the Buckingham 
x theorem; the relevant dimensional variables can be listed as pendulum’s mass (m), 
length of the rigid rod (/), pendulum’s period (t), time (t), gravitational acceleration 
(e), and initial position of the pendulum (ġo = ¢ (0)). 


FIGURE 11.2 


Motion of a pendulum. 


11.2 Apply the nondimensionalization technique to the Navier-Stokes equation for 
the x-component; the incompressible Navier-Stokes momentum equation is given by 


ðu 
plaz + (u - V)u] = —V P + uV°u + pg, 
where p is the density, P is the pressure, jz is the dynamic viscosity, u is the flow 
velocity, ¢ is time, and g is the gravitational acceleration. 


11.3 Consider a simplification of the problem in Example 11.2.1 for the pressure 
drop Ap in a straight pipe of diameter D. Here, in an experiment, it is observed that 
with laminar flow conditions in the pipe, the pressure drop is observed to depend on 
three parameters: the distance / along the pipe between the pressure sampling points, 
the velocity v of the fluid, and the fluid viscosity jz. For this problem, using Bucking- 
ham’s zr theorem, determine how the pressure drop varies with the pipe diameter. 
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11.4 As shown in Lewis et al. (2017), fuel swelling in nuclear fuel is caused by 
the presence of fission gases in small bubbles. Consider a gas bubble of radius R 
embedded in a solid medium with no hydrostatic stress, where the pressure p acting 
to expand the bubble is balanced by the surface tension y of the solid (see Fig. 11.3). 
Using Buckingham’s x theorem, determine the dependence of the pressure on the 
bubble radius and solid surface tension. Hint: For this analysis, use the basic physical 
dimensions for the system of force (F), length (L), and time (T). 


Surface 
tension, 7 


pressure, p 


FIGURE 11.3 


Gas-filled bubble in mechanical equilibrium with a solid (adapted from Fig. 4.61 in Lewis et 
al., 2017). 
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Nonlinear differential 
equations 


Nonlinear phenomena vary from extremely short subfemtosecond processes of 
laser—atom interactions to large-scale gravitational clustering of galaxies. Nonlinear 
physics covers a wide range of topics, such as nonlinear processes in fluids, optics 
and high-power lasers, acoustics, geophysics, astrophysics and cosmology, and quan- 
tum systems. In nonlinear systems, the change in the output is not proportional to the 
change in the input. For example, it is not possible to predict accurately long-term 
weather forecasts due to the nonlinearity of the system. Most systems, such as atmo- 
spheric conditions, economics, quantum mechanics, optics, and fluid dynamics, are 
nonlinear in nature, where it is difficult to find a general solution. Nonlinear dynami- 
cal systems may appear chaotic or unpredictable. 

Unlike linear problems where a family of linearly independent solutions can be 
used to construct a general solution through a superposition principle, it is not gener- 
ally possible to combine known solutions into a new solution with nonlinear systems. 
It is possible sometimes to find very specific solutions to nonlinear equations, which 
makes methods of solution or analysis dependent on the problem itself. 

Examples of nonlinear differential equations include: (i) the Navier-Stokes equa- 
tions in fluid dynamics, (ii) the nonlinear Schrödinger equation in optics and quantum 
theory, (iii) the Boltzmann equation in thermodynamics for a nonequilibrium state in 
Hamiltonian mechanics and galactic dynamics, (iv) the Korteweg-de Vries (KdV) 
equation that arises in the theory of shallow water waves, (v) the Vlasov equation in 
plasma dynamics with a long-range Coulomb interaction, (vi) the Landau—Ginzburg 
theory in superconductivity phase transitions, and (vii) the Lotka—Volterra equations 
in biology. 

Nonlinear differential equations are formed by the products of the unknown func- 
tion and its derivatives. Unlike a linear equation, where the variable and its derivatives 
appear only with a power of one, in the nonlinear equation any power different from 
one appears. In addition, the appearance of any function (e.g., log, sin, except for a 
first-order polynomial), which depends on the dependent variable, results in a non- 
linear equation. 


Example 12.0.1. Determine if the equations given below are linear or nonlinear. 


ax x g 0 np 0 ar 3 0 
aa ta t5x=0; a +t- =O; a7 rx =U; 
ot? ðt ðt x ar? 
ox oy 3?x Ox 
—+—=0; —~ +sin(t) = 0; — +sin(x) = 0; 
ot at ar? at 
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3x a Ox 0 
ans at 
Solution: 
We have: 
3x ax 
— +—+5x=0, li : 
JZ + at + 5x inear 
ox 1 3 ; 
BE +-=0, nonlinear because 1/x is not of the first power; 
x 
3?x 3 f 3. 
a2 +x” =0, nonlinear because x” is not of the first power; 
ð ð 
S + = =0, linear; 
3?x : ; 
ae sin(t) = 0, linear; 
t 
Ox ; 3 ae : 
ae + sin(x) = 0, nonlinear because sin is a function of the dependent 
variable; 
a>x Ox 
— +t— =0, linear. 
ət? at 


In general, nonlinear equations are difficult to solve analytically, and numerical 
methods are employed to approximate solutions. In some cases, equations can be 
solved analytically, but due to the availability of powerful computing tools, nonlin- 
ear equations may be chosen to be solved numerically. Nevertheless, it is desirable 
to compare the approximate solution with the exact one if an exact solution exists. 
Analytical as well as numerical methods are applied to find roots of a nonlinear func- 
tion or to find the solution of an initial and boundary value problem of a nonlinear 
equation. Except for some very special functions, it is not possible to find an analyt- 
ical expression for the root from which the solution can be exactly determined. This 
result is true for even some polynomial functions. 

An exact solution for a nonlinear system may exist for a specific case. Some tech- 
niques, such as inverse scattering and Backlund transforms, have been proposed and 
applied to solve analytically a particular type of equation, such as the KdV and sine- 
Gordon equations. Besides analytical solutions, numerical solutions, such as the finite 
difference method, have been applied. In some cases, nonlinear dynamical equations 
are approximated by linear equations (that is, linearization). Linearization may work 
up to some accuracy within some range of input parameters, but some features can 
be lost with linearization. Analytical and numerical solutions to initial and bound- 
ary value problems, specific to nonlinear equations, are discussed in the following 
sections. 

Nonlinear waves are fundamental covering a wide range of phenomena from aero- 
dynamics and hydrodynamics, solid state physics and plasma physics, to optics and 
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field theory, population dynamics, nuclear physics, and gravity. In particular, soli- 
tons are the solutions of a widespread class of weakly nonlinear dispersive partial 
differential equations. They find application in fiber optics, quantum mechanics, bi- 
ology, and magnets. Solitons result from the cancelation of nonlinear and dispersive 
effects in the medium (that is, where the speed of the waves varies according to the 
frequency). 

Since solutions to nonlinear problems are case-specific, it is not possible to dis- 
cuss each and every case and solution. Some well-known nonlinear partial differential 
equations include: 


e The nonlinear Schrödinger equation: 


ol ; 
iW == Vw +kKlwlhy, 


where w is the complex field and x is the wave number. The principal applications 
of the nonlinear Schrödinger equation are: (i) propagation of light in nonlinear op- 
tical fibers, (ii) Bose-Einstein condensates for special cases, (iii) Langmuir waves 
in hot plasma, (iv) propagation of plane-diffracted wave beams in the focusing 
regions of the ionosphere, (v) propagation of Davydov’s alpha-helix solitons re- 
sponsible for energy transport along molecular chains, and (vi) small-amplitude 
gravity waves on the surface of deep inviscid water. 


e The Navier-Stokes equations, 


1 
p (ur +u: Vu) =-VP + eV 'u+ 34V (V u) + p8, 


where u is the flow velocity, P is the pressure, u is the dynamic viscosity, o is 
density, and g is gravitational acceleration. The Navier-Stokes equations may be 
used to model air flow around a wing, the weather, ocean currents, and water flow 
in a pipe. These equations find application in the study of blood flow and in the 
design of aircraft and cars. Coupled with Maxwell ’s equations, they can be further 
used to model and study magnetohydrodynamic phenomena. 


e Reaction—diffusion equations, 
ut = Duxx + Rw), 


where u is the concentration variable, D is the (diagonal) diffusion coefficient 
(scalar matrix) independent from coordinates, and R(u) is the local reaction ki- 
netics. Reaction—diffusion equations are employed in defining systems in chem- 
istry, biology, geology, physics (neutron diffusion theory), and ecology. If the 
reaction term vanishes, the equation represents a pure diffusion process known 
as Fick’s second law of diffusion in Eq. (5.9). With R(u) = u(1 — u), which 
is the Fisher equation, the reaction—diffusion equation is used to describe the 


e E 
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spreading of biological populations. With R (u) = u(1 — u?), which is the Newell- 
Whitehead-Segel equation, the reaction—diffusion equation is used to describe 
Rayleigh—Bénard convection. The reaction—diffusion equation with the reaction 
term defined as R(u) = u(1 — u)(u — a) with O < œ < 1 is the general Zeldovich 
equation used in combustion theory. 


In the following nomenclature, u; and y; denote partial derivatives with respect to 
time, t, and ux, Uxx Wx denote partial derivatives with respect to the space coordinate, 
x. 

In this chapter, solutions are sought for waves which have wide application to 
optics, acoustics, and hydrodynamics. Several analytical methods, followed by some 
numerical solutions, are discussed in the following sections. Of particular importance 
is the KdV equation, which is a hyperbolic partial differential equation (see, for ex- 
ample, Section 6.2.3), where solutions of hyperbolic equations are “wave-like.” If a 
disturbance is introduced in the initial data of a hyperbolic differential equation, the 
disturbance is not felt at every point of space at once. Relative to a fixed time coordi- 
nate, disturbances have a finite propagation speed and travel along the characteristics 
of the equation. Unlike a hyperbolic equation, a perturbation of the initial or bound- 
ary data of an elliptic or parabolic equation is felt at once by essentially all points in 
the domain. 

The KdV equation is particularly important as a prototypical example of an ex- 
actly solvable nonlinear system (that is, completely integrable infinite dimensional 
system). The KdV equation describes shallow water waves that are weakly and 
nonlinearly interacting!, ion acoustic waves in a plasma, long internal waves in a 
density-stratified ocean, and acoustic waves on a crystal lattice. In addition, it is the 
governing equation for a string in the Fermi-Pasta-Ulam-Tsingou problem in the 
continuum limit. The KdV equation is one of the most familiar models for solitons 
and it is an important foundation for the study of other equations. The KdV equation 
can be expressed in different forms, some of which are in the nondimensional form, 
as shown below: 


(a) Ut — UU, + Uxxy =O, 
(b) Ut + 6uuy + Uxxx = 0, (12.1) 
(c) ut + (1 + u)ux + Uyxx =Q, 


where u is the average nondimensional velocity. Due to the dispersive term (that is, 
of third order), waves decay while waves will steepen due to the nonlinear term. 


1 Assume compound waves with two peaks, the taller to the left, both propagating to the right. The taller 
wave moves faster than, catches up on, and interacts with the shorter wave. It then moves ahead of it. 
In this case, the interaction is not linear. Unlike the linear interaction where the two waves satisfy linear 
superposition, in the nonlinear interaction, the waves are phase-shifted. 
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12.1 Analytical solution 


In this section, analytical solutions to nonlinear initial and boundary value problems 
are discussed. The most common analytical solution that is applied to many exactly 
solvable nonlinear systems is the “inverse scattering transform.” Another method is 
the “traveling wave solution” for waves maintaining a fixed shape and advancing in 
a particular direction. 


12.1.1 Inverse scattering transform 


The inverse scattering transform is a nonlinear analog of the Fourier transform (see 
Section 3.2). In general, this method can be applied to solve many linear partial dif- 
ferential equations. Unlike the direct scattering method, where the scattering matrix 
is constructed from the potential, in the inverse scattering method, the potential is 
recovered from its scattering matrix. To be more precise, the time evolution of the 
potential is recovered from the time evolution of its scattering data. 


For example, in the case of the KdV equation, the following steps need to be 
applied in order to determine the time evolution of the flow velocity u(x, t) from the 
initial condition given by u(x, 0): 


1. Find the Lax pair consisting of two linear operators L and M for the nonlinear 
equation of interest. 

2. Determine the initial scattering matrix S(A, 0) at t = O from the initial potential 
u(x, 0) by solving the direct scattering problem, using the first operator L. 

3. Determine the time evolution of the scattering matrix S(A, t) at time f, using the 
second operator M. 

4. Determine the potential u(x,t), which is the sought-after solution to the nonlin- 
ear equation of interest from the scattering data S(A,t) by solving the inverse 
scattering problem. 


The concept of the inverse scattering transform is demonstrated in Fig. 12.1. 
The application of the method for the four steps depicted in Fig. 12.1 is detailed 
below. 


1. Lax pair for the nonlinear equation. 
A derivation of the Lax pair, which recovers the nonlinear equation (12.1)(a), is 
developed as follows. Thus, linear operators L and M are defined such that they 
satisfy the original equation, which in this case is the KdV equation. The operators 
L and M depend on an unknown function, u(x,t). The first linear differential 
operator, L, may depend on x, but not explicitly on t; it describes the spectral 
(scattering) problem to establish the scattering matrix at t = 0 as 


Lé=Ad, (12.2) 


where ¢ is the eigenfunction and å is the time-independent eigenvalue. 


a 


334 CHAPTER 12 Nonlinear differential equations 


Determine Scattering Matrix at t=0; S(A,t=0) 


Known: 
Initial Conditions 4 Direct scattering for linear ordinary | (using the first equation in the Lax pair): 
Potential at t=0; differential equations at t=0 Scattering Matrix include: 
u(x,0) discrete eigenvalues, norming constants, 
transmission and reflection coefficients 


Time evolution of scattering data with time 
independent spectral paramater (or eigenvalue): A,= 0 


Determine Scattering Matrix for t > 0; S(A,t) 


Inverse scattering for linear ordinary | time evolution of the scattering matrix 
differential equations for t > 0 (using the second equation in the Lax pair): 

Time evolution of : norming constants, 

transmission and reflection coefficients 


Solution 
fort >0: u(x,t) 


FIGURE 12.1 


Diagram showing the steps involved in the inverse scattering transform. 


The operator M describes how the eigenfunction evolves in time, satisfying the 
following equation: 


Op 

at 
For consistency, these equations can be solved for Ld¢/dt by taking the time 
derivative of Eq. (12.2). The resultant equation becomes 


Mọ. (12.3) 


OL ap OA ad 
L = A—. 12.4 
ot oe ot ot or ot ( ) 
Rearranging Eq. (12.4) using Eq. (12.3), 
OL Or 


Since the spectral parameter (or eigenvalue) à is independent of time (that is, 
01./dt = 0), the Lax equation is expressed as 


OL 

ree ea (12.6) 
which is a nonlinear partial differential equation for u(x,t) due to the nonzero 
commutator LM — ML of the two nonconstant operators. In addition, [L, M] = 
LM — ML is a multiplicative operator. 
Let L be the Schrédinger operator, 


bee ey 12.7) 


Eee 
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The M operator becomes 


Mat 4 + 6u— +3—. (12.8) 


The M operator, along with the Schrödinger operator, Eq. (12.7), satisfies 
Eq. (12.6), and hence u satisfies the KdV equation, in the form of 


25 12. 
ot Oe Yaa wee) 


Inserting Eq. (12.7) and Eq. (12.8) into Eq. (12.2) and Eq. (12.3), respectively, the 
Lax pair can be rewritten as 


2 


ips" eee. (12.10) 
ax2 


Ah I~ ə 
g e= t3 are t3 
The Lax equation, Eq. (12.6), with its pair, Eq. (12.7) and Eq. (12.8), is the com- 
patibility condition for the equation of interest, that is, the KdV equation. In the 
Lax equation, L; = ə L/ðt is the time derivative of L, where it explicitly depends 
on t. With the combination of the appropriate Lax pair, the Lax equation recovers 
the original nonlinear partial differential equation. The suitability of the Lax pair 
for the KdV equation is demonstrated below. 

Inserting Eq. (12.7) and Eq. (12.8) into Eq. (12.6), 


u 
d. (12.11) 
Xx 


aL ðu ae a ðu a 
= =e eer 6 ee f 12.12 
a ar | TUTUT T | ales, 
where the right-hand side of the equation is equal to — (LM — ML) = (ML — 
LM), with 
a 3? du 3? 3u a 
LM = —4 10 15 12 6u? 
ax? T “9x3 a ax Ox? T ( 3x2 >” ) ax 
+ gH gu” (12.13) 
z Su |}, ‘ 
ax3 ax 
a5 as du 3? 3u a 
ML=-4 10 15 12 6u? 
ax? F “9x3 g ax Ox? a ( ax >” ) ax 


aru ou 


e E 
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Finally, 


3u ðu 
[L, M]= (LM — ML) = | ——~ + 6u— |. (12.15) 
əx? Ox 


Using the commutator operator [L, M], one recovers the original KdV equation 
ð 
in Eq. (12.9), where = =[L, M]. 


2. Direct scattering — determine the initial scattering matrix S(A, 0) at t =0. 
For a given potential, u(x), the problem is to find the spectrum of the linear oper- 
ator L (that is, a set of admissible values for à) and to construct the correspond- 
ing functions (x; A). The first Lax equation represents the time-independent 
Schrödinger equation: 


Z" _up=-—ìġ. (12.16) 


Here, A is a constant eigenvalue, ¢ is an unknown eigenfunction of t and x asso- 
ciated with the eigenvalue A, and u(x, t) is the solution of the KdV equation that 
is unknown except at the time t = 0. 

It is assumed that u — 0 sufficiently rapidly as x — +00 so that the term 
(ud) vanishes. Hence, as follows from Eq. (12.16), xx ~ —Ad@. This latter equa- 
tion provides an asymptotic solution for ¢, which is a linear combination of 
exp(+i/2.x) as derived from Section 2.2.2. Consequently, @ decays exponentially 
at infinity if A < 0, and oscillates sinusoidally at infinity if A > 0. Therefore, solu- 
tions are sought for two different spectra: (a) a discrete spectrum (bound states), 
where à < 0, to obtain discrete eigenvalues (A or x) and the norming coefficients 
of the eigenfunctions, Cn, and (b) a continuous spectrum, where 4 > 0 for deter- 
mination of reflection and transmission coefficients. 


(a) Discrete spectrum (bound states) at t = 0: With A <0,« = J-i > 0. 

If the potential u(x, 0) is negative near the origin of the x axis, the spectral prob- 
lem implies the existence of a finite number of eigenfunctions ¢ = n(x, à), n = 
1,---,N, where N is the number of bound states. The discrete spectrum has the 
following characteristics: 


° An = —K? < 0, K1 > KQ++* Kn n= l, ++, N; 
e the spectrum is nondegenerate with a one-to-one correspondence of A, with 
Pn; 


e min{u(x,0)} <A1 <A2 <--- <0; 
e eigenfunctions n that are square integrable are used for the normalization, 
lonl? = foo, pdx = 1. 


The asymptotic behavior will be consistent with the Schrödinger equation, xx ~ 
—ìġ, at |x| — oo, where u vanishes, if 


n(x, 0) ~ Cn (0) e™™* as x—> o. (12.17) 


eee 
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Here C,,(0) are the norming coefficients at t = 0. 


(b) Continuous spectrum at t = 0: With A > 0, « = Jk > 0, all solutions of Q in 
Eq. (12.16) are sinusoidal as x — too. Therefore, for all à > 0, eigenfunctions 
are bounded but not square integrable. The solution can be expressed as a super- 
position of an incident wave, with reflected and transmitted waves, as shown in 


Fig. 12.2. 
u 
Incident 
= e ikx 
ee et Reflected 
S ; > R(k,t) el 


FIGURE 12.2 
Scattering by a potential. 


For k = V/A > 0, a solution is sought for 


T (k, 0) e~#** as x — —00, 


e kX + R(k, 0) el as x > œ, 


x(x, 0) ~ (12.18) 


where T (k, 0) is a transmission coefficient, R(k, 0) is a reflection coefficient, and 
k is the wave number. Note that the functions R(k, 0) and T(k, 0) are not inde- 
pendent, which can be expressed by the total probability relationship 


IT? +|RP = 1. 


(c) Scattering data at t = 0: 

Finally, from the analysis of discrete and continuous spectra, the scattering data 
are expressed in terms of discrete eigenvalues, Kn, norming coefficients, Cn (0), 
transmission coefficients (T (k, 0)), and reflection coefficients (R(k, 0)): 


n=l? 


SO, 0) = (fen, CuO, R(K, 0), Tk, 0)) (12.19) 
3. Time evolution of the scattering data S(A, t). 
(a) Discrete spectrum at t > 0: 


With the potential u(x,t) vanishing at x — oo, the second operator, M (see 
Eq. (12.8)) in the Lax pair reduces to 


a3 
M=—-4—. (12.20) 


OOOO 
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Inserting Eq. (12.17) into Eq. (12.3) with M defined in Eq. (12.20), 


3 


0 3- A, 3 —KnX 
pth) = Mp, t) = 4z Ce nX — AOC, (t eT, (12.21) 


Given that 
dC,(t) 
dt 
the time evolution of the norming coefficients, C, (t), is 


= 4K} Cn (t), 


C, (t) = Cr (O) et! (12.22) 


The norming coefficients used in the Marchenko kernel for the inverse scattering 
transform are 


C!) = (Ch (t)? so that C!) = C0) eSt, (12.23) 
as follows from Eq. (12.22). 


(b) Continuous spectrum at ¢ > 0: 
In this section, the derivation of the time evolution of the continuous spectrum is 
discussed briefly. The details can be found in Drazin (1985) and Koelink (2006). 


The eigenfunctions defined in Eq. (12.18) for x — oo can be rewritten as 
(x,t) = Are + Bae, (12.24) 


with A(O) = 1 and B(O) = R(k, 0). Inserting Eq. (12.24) into Eq. (12.3) with M 
as defined in Eq. (12.20) for x — oo yields 


bi (x, t) = Mok (x, t) = —4(—ik) A(t) eT — 4(ikP Bel. (12.25) 


Here A(t) =e"! and B(t) = R(k, 0) e“, where 
R(k, t) = R(k, 0) 8", (12.26) 
Similarly, rewriting the eigenfunction defined in Eq. (12.18) for x — —on, 
d(x, t) = Cite + Ditye™, (12.27) 


where C(0) = 0 and D(O) = T (k, 0). Inserting Eq. (12.27) into Eq. (12.3) with M 
defined in Eq. (12.20) for x — —oo gives 


Čao, t) = M(x, t) = 4k PCA) ee! — 4(-ik PDA ee". (12.28) 


A 
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Here C(t) = 0 and D(t) = T (k, 0) e74, where 
T(k,t) =T(k, 0). (12.29) 
Eq. (12.29) indicates that the transmission coefficient is time-independent. 


(c) Scattering data at t > 0: 

Finally, from the analysis of discrete and continuous spectra at t > 0, the scatter- 
ing data are expressed in terms of discrete eigenvalues («,,), norming coefficients 
(C,,(t)), transmission coefficients (T (k, t)), and reflection coefficients (R(k, t)): 


SA, t) = (ten, Cn (O}A i, RK, t), T (k, 1) ’ (12.30) 


4. Inverse scattering transform. 
The following steps are followed to recover the potential, u(x, t), for t > 0 from 
the scattering data, S(A, t), in Eq. (12.30): 


e The scattering data are used in the Marchenko kernel: 


N 
1 se 
F(y,t)= =| dkR(k, nel +) Cage. (12.31) 
—00 


n=1 


e The Marchenko integral, 


CO 
Kany Fe ty. + | dzK (x,z,t)F(y +z,t) =0, x<y<wm, 


x 


(12.32) 
is solved to obtain a solution for K (x, y, t). 
e The potential, u(x,t), is then obtained by employing the relation 
OK (x,x,t 
ua, p= 2 (12.33) 
x 


A special case is the reflectionless potential, where R(k, t) = 0 with N = 1 in 
Eq. (12.31). This case results in a soliton with an amplitude 2x? that propagates to 
the right with the velocity 4x7: 


u(x, t) = —2«?sech? [k (x — 424 — xo) | , (12.34) 


Moreover, the solution of the KdV equation corresponding to a reflectionless po- 
tential with N-solitons can be represented by a superposition of N single-soliton 
solutions propagating to the right (see Drazin, 1985), 


N 
u(x,t) ~ — Y` 2k2sech? |en (x = 4424 — xn)| . S0, (12.35) 


n=1 
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where the phase lag, xn, is given by 


=] 2 
1 Cn (0) TF [Kn — Km 

EEN , 12.36 

an Ge af 2k, H Kn + Km on 


n=1 


In the KdV equation, the discrete spectrum gives N solitary waves, whose am- 
plitudes, —2«?, are proportional to their velocities, such that the bigger waves move 
away faster to the right. The continuous spectrum gives the dispersive wave compo- 
nents of the solution. Dispersive waves travel to the left and spread out where the 


amplitudes die away. 


12.1.2 Traveling wave solution 

A traveling wave moves in space, where the traveling wave variable can be defined 
as E = x — ct, where c is the constant wave velocity. Substituting this solution form 
into the partial differential equations gives a system of ordinary differential equations 
known as a traveling wave equation. The permanent wave solution in the form of 
u(x,t) = f(E) is sought when applied to the KdV equation in the form 


ou ðu m au -o 
ot ax ax3 


The function f(&) and c should be determined by substituting the assumed form of 
solution into the KdV equation, yielding an ordinary differential equation for f (E) as 


(12.37) 


—cf' —6ff' +f" =0, (12.38) 
where f’ is defined as 
fis 
f = gf): (12.39) 
Note 
af afa af ə ə 
ar atar 0E a oE 
and 


af af ag əf er) 
dx d& Ox dé’ ax 3E 


The integration of Eq. (12.38) gives 
—cf—3f?°+ f”—A=0, (12.40) 


where A is the integration constant. Multiplying Eq. (12.40) by f’ and integrating the 
resultant equation yields 


F PUT Af =B, (12.41) 


a = 
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where B is the integration constant. This equation can also be interpreted as the “con- 
servation of energy,” where 1/2 ( f' M represents the kinetic energy and —1/2cf? — 
f? — Af represents the potential energy. For the potential, critical points occur if 
—cf —3f* — A = 0 in Eq. (12.40). The discriminant (that is, b? — 4ac) is c? + 12A. 
If c? + 12A <0, then the potential energy monotonically increases, and the traveling 
waves are not bounded. In the case of c? + 12A > 0, there are two equilibria. 

This equation can also be written as 


1 n2 3 1 2 = 

a) Sf bated +Af+B=F(f). (12.42) 
This is a first-order ordinary nonlinear differential equation, and the roots of the cubic 
F(f) equation (see the right-hand side of Eq. (12.42)) are important and can be 
identified (see Fig. 12.3) as: 


1. three distinct real roots (see Fig. 12.3 (C)); 

2. two real roots, one of which is a double root and the other one is a simple root 
(see Fig. 12.3 (B) and (D)); 

3. one real and two imaginary roots (see Fig. 12.3 (A) and (E)); 

4. one root of order three (see Fig. 12.3 (F)). 


(A) (B) 


ny 
Bs 
= 
E 


(E) (F) 


nm) 
-h 


FIGURE 12.3 


Roots of the function F (f) for different cases (reproduced from Drazin, 1985). 


Depending on the cases of interest (e.g., cnoidal or solitary waves) that define the 
roots of the equation, some solutions to Eq. (12.42) arise as discussed below. 


1. In the case where the function F(f) has one root of order three (see Fig. 12.3 
(F)), f = —c/6 and the exact solution of Eq. (12.42) is 


E O 
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c 2 
(os. (12.43) 
2 (E — £o)? 
Note that, in this case, the solution is not bounded. 
2. In the case where the function F(f) has three distinct real roots (see Fig. 12.3 
(C)), defining cnoidal waves, the solution is obtained using the Jacobian elliptic 
function. Conversely, the frequency of the cnoidal wave is described as 


w = kc 


=—2k(fi+ fot f), 


where fi, f2, and f3 are the roots of the function F (f). The solution is obtained 
as 


kx — ot 

ue.n=fO=F( i ). (12.44) 

3. In the case where the function F(f) has two real roots, one of which is double 
and the other one is simple (see Fig. 12.3 (B)), this is an example of a solidary 
wave, where f(&) decays rapidly. In the case of solidary waves, the boundary 
conditions can be defined as f, f’, f” —> 0 as E — ov, resulting in A = 0, B = 0, 
and [1/2(f’)* = f?(f + 1/2c)]. Consequently, 


_ f a _ df 
s=) F =) TIFE 
and the solution is 
fE =—5sech? [£ é- J 


or 


c 2 
u(x,t) = 7 sech 


[£e ct J (12.45) 


The maximum of u lies at x = ct, leading to an amplitude of c/2. The width is 
inversely proportional to ./c. 


12.2 Numerical solution 


Numerical methods can be applied to: 


e find the roots of a nonlinear equation (these methods include, but are not limited 
to, (i) interval halving (bisection method), (ii) linear interpolation, (iii) Newton’s 
method, (iv) quotient difference, and (v) the secant method (see Section 9.1); 

e solve nonlinear initial and boundary value problems. 


D 
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12.2.1 Roots of a nonlinear function 


Newton’s method, as discussed in Section 9.1.2, is one of the most widely used tech- 
niques to find roots of a nonlinear equation. Other techniques can be found in books 
specialized in numerical methods. 


12.2.2 Nonlinear initial and boundary value problems 


One of the common methods that is applied to solve a nonlinear differential equation 
is the finite difference method (see Chapter 6). In this section, the application of the 
finite difference method is discussed for the KdV equation. The first and third deriva- 
tives in the KdV equation can be approximated by the Taylor series as presented in 
Chapter 6. The scheme that is described below was proposed by Zabusky and Kruskal 
(1965). It uses the central difference technique. The subscript i represents the space 
variable, and the subscript j represents the time variable, with u; į = u(i Ax, j At). 


Using the central difference, the time derivative of u can be approximated by 


_ Mi, j+1— Yi, j-1 
= ——_ O 


12.46 
2At ( ) 


Using the central difference, the first space derivative of u can be approximated by 


_ Uil, j — 4i-1,j 


12.47 
2Ax ( ) 


x 


The nonlinear term in the KdV equation can be approximated by taking a local spatial 
average of u (instead of simply u;) as 


ma Ui+l,j + ~ + Mi-1,j f (12.48) 


Using the central difference, the dispersive term, which is the third derivative of 
u, can be approximated by 


Ui+2,j — QUi+1,j + 2ui—1,j — Uj-2, j 


12.49 
2Ax3 ( ) 


Uxxx = 


Inserting the discretized forms into the KdV equation given by Eq. (12.1)(a), the 
explicit leapfrog finite difference scheme (Zabusky and Kruskal, 1965) is expressed 
as 


At 
Ui j+1 =Ui,j-1 + rE (ui+1,j + Uj, j + Ui Lj) (ui t1,j — Ui 1j) 


At 
-Ka (ui+2,j — 2ui+1,j + 2ui—1,j — Ui-2, j). 


(12.50) 
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The truncation error is of order (O (At)? + O(Ax)®). With u > —u, the linear sta- 
bility requirement for this scheme (Taha and Ablowitz, 1984) is 


1 
(Ax)? 


2 
< . 
3/3 


Other schemes and methods (that is, implicit schemes and finite Fourier transform 
or pseudospectral methods) that were proposed to solve the KdV equation are also 
discussed in Taha and Ablowitz (1984). 

The KdV equation defines the waves traveling in one direction (i.e., right-moving, 
+ x direction). In the derivation of the KdV equation, mass (i.e., continuity) and mo- 
mentum conservation equations as well as appropriate boundary conditions are used. 
Similarly, waves moving in both directions (i.e., left and right) are described by the 
Boussinesq equation. Adopting the formulation given by Whitham (1974), the mass 
and momentum equations, which are discussed next, can be solved separately for a 
more robust numerical stability. 

The continuity equation is given by 


t 
2 
ae ug + 


on ð = 
a tag tel =o (12.51) 


and the momentum equation is given by 


ðu ðu am _ | 3u 
at dx Ox 3 0x2dt’ 


(12.52) 


where y is the wave amplitude, x is in the units of ho, which is defined as the undis- 
turbed depth of water, and ż is in the units of ho/co. Here co is the speed of gravity 
waves as defined as ./ghg. Also, g is the acceleration constant of gravity and u is the 
average velocity over a plane normal to the wave propagation direction. 


Based on the numerical scheme suggested by Peregrine (1966), the following 
steps are applied: (1) solve the continuity equation to calculate the amplitude for 
an advanced time step, (2) using the amplitude from the previous step, solve the 
momentum equation to determine the average horizontal velocity, and (3) substitute 
the average horizontal velocity from the previous step into the continuity equation, 
to obtain a new corrected amplitude. With the time steps denoted by j, the space 
steps by i, and the uncorrected amplitude by *, the discretization scheme is defined 
as follows: 


1. Discretize the continuity equation to obtain the uncorrected amplitude for an ad- 
vanced time step, 


Ni j+1 T ij ae Witt, j +Ui-1,; 
a [a ee) | 

o Ni+,j —Ni-1,j 

Tuj 2Ax 


(12.53) 
=0. 


ao OO O 
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2. Discretize the momentum equation to obtain the average horizontal velocity for 
an advanced time step, 


“ijti TMi oy Ui+l,j+1 — Ui-1,j t Ui+l,j — Ui-l,j 


At k 4Ax 
Nipi j+ T i j4 + Mitjo Mi- (12.54) 
4Ax f 
Luigi jt — Ui jt + Uii, j+ — Midi, j + 24i j — Ui, j 
3 Ax?At f 


3. Discretize the continuity equation to obtain the corrected amplitude for an ad- 
vanced time step, 


Ui+1,j+1 — Ui—1,j+1 F Ui+l,j — Ui—l,j 

4Ax 

Ni+1,j ~ Ni—1,j 
2Ax 


Ni, j+1 — i,j 
At 


+ (1+ 7i,;) + 
(12.55) 


1 
= (ui j+1 + uij) =0. 


2 
This scheme, with discretized initial and boundary conditions, was applied to predict 
the height of waves in a tubular test section (see Onder, 2004). 
The analytical and numerical methods discussed in this section can be used as a 
basis to seek solutions to other nonlinear problems. 


Problems 


12.1 Discuss the finite difference solution(s) to the Navier-Stokes equations. 
12.2 Obtain the solution for the KdV equation given in Eq. (12.45) by assuming 
a traveling wave of the form u(x,t) = f(E), where E = (x — ct — x). Solve this 


problem with direct integration by explicitly separating the differentials and using 
antiderivatives. Use Maple to solve the resulting integral that arises in this derivation, 


lor 


ð ð 
12.3 Show that the classical advection equation = + c = 0, with u = u(x,t), 
x 
can be reproduced with the Lax operator pair 
32 
L= ax2 = ul, 
ð 
M=-c—, 
ox 


where Z is the identity operator. 


a a 
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12.4 Classify the following equations as linear or nonlinear: 
(a) un = uxx, 
(b) o pu; =V-(K(u) Vu), where o and p are constants, 
© w= f(u,t), 
(d) V2u=Asin(u), where A is a constant, 


Ju a 
O ~44 5, 
X 


12.5 Consider the nonlinear ordinary differential equation 
w(t) = au (t) + bun), 


where a and b are constants. The time derivative for this equation can be discretized 

as 

Un+1 — Un 
At 

where u, and u,+, are the values of the function u(t) at the respective nth and n + Ist 

mesh points. 


u(t) = 


’ 


(a) Whatis the iteration scheme for this nonlinear equation using the forward Euler 
discretization method in Eq. (6.4)? 

(b) Consider a Crank—Nicolson discretization method for this ordinary differential 
equation with a central difference approach given by 


Un+1 — Un 2 
art Sauny +b [un] : 


1 is approximated by a standard arith- 


What is the resulting iteration scheme if u,,, 1 
2 


metic mean u, 1 = z [un + un+1] but linearized with a geometric mean for the 


any) 
nonlinear term [ena] ~ Un Unt? 
12.6 A well-known model for population growth involves the nonlinear ordinary 
differential equation 
u =a(u)u, u(0)=1, 

where the growth term is given by 

u Nr 
aw=o(1- 3) 


and r is a constant. Here the growth rate factor a(u) ~ p allows the population to 
grow initially with unlimited access while the population u is small but tails off where 
a(t) approaches zero as the population u approaches a maximum M that the environ- 
ment can sustain. 


oo OO O 
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The standard Crank—Nicolson approach for the iteration scheme for this differential 
equation from Section 6.2.2 would be 


Un+1 — Un 


1 
A = z [a Unun +a(un+1)ün+1]. 


However, this equation is nonlinear. Thus, linearize the iteration scheme using a ge- 
ometric mean as presented in Problem 12.5. 


CHAPTER 


Integral equations 


In previous chapters—for example, Chapter 2 and Chapter 5—differential equations 
arise that involve an unknown function with one or more of its derivatives. An inte- 
gral equation can also occur involving an unknown function that is contained within 
an integral. Differential equations provide a means to describe physical laws in engi- 
neering that arise in chemical kinetics, heat and mass transfer, and electric circuits, 
to name a few. The differential equations can be transformed into equivalent integral 
equations. Moreover, integral equations also arise in the modeling of complex be- 
havior of phenomena such as radiation and diffraction in optics, seismic responses 
of dams, harmonic vibrations in material, unsteady heat transfer, and eddy currents 
in turbulent fluid flow (Rahman, 2007). These problems in many cases require a nu- 
merical solution. Important representations can also arise as an integro-differential 
equation, which occur with radiative transfer of energy in absorbing, emitting, and 
scattering media, as well as in neutron transport theory. 

Various solution methods for these equations are presented in Section 13.1.1. 
There is a close connection between differential and integral equations. Integral equa- 
tions may in some cases offer a more powerful method of solution as compared 
to differential equations since the boundary conditions are built into the integral 
equation rather than imposed as the last step of the solution method. As shown, for ex- 
ample, in Section 13.1.2, values on the boundaries in integral equations are contained 
in a kernel. This methodology naturally leads to the Green’s function in Section 13.2. 
Here the Green’s function is in fact an integral kernel. 

The Green’s function can be used to solve a large family of problems such as 
ordinary differential equations with initial or boundary value conditions and inhomo- 
geneous partial differential equations with boundary conditions. The Green’s function 
method has wide applications in physics and engineering, including quantum field 
theory, areoacoustics, seismology, statistical field theory, electrodynamics, and me- 
chanical oscillators, to name a few. 


13.1 Integral equations 


The integral equations can be broadly classified as Fredholm and Volterra integral 
equations, depending on the nature of the limits of integration. Following Arfken et 
al. (2013) and Press et al. (1986), integral equations can be classified in two ways: 
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e The type of integral equation depends on whether the limits of integration are 
fixed or variable. The Fredholm equation has constants as integration limits. The 
Volterra equation has one limit that is variable. 

e The integral equation can be further classified as a “first kind” if the unknown 
function only appears under the integral sign or as a “second kind” if the function 
appears both inside and outside of the integral. In addition, they can be classified 
as homogeneous or nonhomogeneous integral equations, leading to several cases 
as follows (BYU, 2020; Rahman, 2007): 


(a) Fredholm’s integral equation of the first kind: 
b 
/ K(x, y)u(y) dy = f(x), Ku= f (nonhomogeneous), 
a 
b 
/ K(x, y)u(y) dy =0, Ku=0 (homogeneous); 
a 
Fredholm’s integral equation of the second kind: 
b 
u(x) = a f K(x, y)u(y)dy + f(x), u=AKu+ f (nonhomogeneous), 
a 
b 
uc) =a f K(x, y)u(y) dy, u=AKu_ (homogeneous). 
a 
(b) Volterra’s integral equation of the first kind: 
X 
[ ke nuordy= reo: 
0 
Volterra’s integral equation of the second kind: 
x 
u(x) =i) K(x, y)u(y) dy + f(x). 
0 


(c) Integro-differential equations include an unknown function under the integral 
sign and also any derivative of the unknown function: 


du 
Haut f Ko. yuo rdy + £00. 
x G 
which is applicable over the domain space G = (a, b). 
(d) Singular integral equations where the limits of the integral are infinite or the 
kernel K becomes unbounded such that 


[0,6] 


u(t) dt, 
[0,6] 


u(x) = fay +2 f 


sos [MOO ar, 0<p<1. 
o (x—1)F 


ee) 
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In these integral equations, the dependent variable, u, is an unknown function, and the 
kernel, K(x, y), f(x), and the parameter A are assumed to be known. The equation 
is homogeneous if f(x) = 0. 


The integral equations include an integral term in the form of an integral operator 
with the kernel K (x, y), such that 


b(x) 
Ku =] K(x, y)u(y) dy. 


The Fredholm equations above are shown along with a simplified form using an in- 
tegral operator. 


Existence and uniqueness of solution 


As detailed in BYU (2020), the existence and uniqueness of a solution for a Fredholm 
equation of the second kind, 


u=AKu+ f, (13.1) 


can be determined as follows. Here, K is a bounded operator that satisfies the Lip- 
schitz condition ||Ku, — Ku2|| < k||u, — u2|| for k > 0, where || f ||c = max| f (x)| 
and C is a vector space contained in the complete space R”. One can further rewrite 
the integral as 


u=Tu, (13.2) 


where T is the operator: Tu = AKu + f. The Banach fixed point theorem can be 
used to guarantee the existence and uniqueness of fixed points of self-maps, where, 
in this application, u is a particular fixed point of T. This theorem in fact provides an 
abstract formulation of Picard’s iteration method. This latter method is described as 
a solution method for integral equations below. Thus, the fixed point theorem can be 
used to ensure that the integral operator has a unique fixed point. Using the Lipschitz 
condition, one can show that 


[Tui — Tug|| = AKu: + f —AKu2 + f) 
= ||AKuı —AKug|| = |Al||K (u1 — u2) 
<|AlA|[Auy — u2ll. 


Moreover, if |A|k < 1, then the operator T is a contraction. As such, from the Banach 
fixed point theorem, there exists a unique fixed point for Eq. (13.2). This fixed point 
is also a solution of the Fredholm integral equation in Eq. (13.1). Thus, the Fredholm 
equation of the second kind with a bounded kernel has a solution for a value of 
A <I/k. 


Typical applications of integral equations and integro-differential equations that 
arise in engineering are presented as follows. 


OOOO. 
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Example 13.1.1. As a generalization of Hooke’s law (see Example 2.2.5), viscoelas- 
ticity is an important phenomena that arises in materials engineering to describe the 
behavior of rubber, plastics, and glass, to name a few. The viscoelastic constitutive 
relations for the stress ø (t) as a function of the strain rate €(f) can be expressed as a 
Volterra type integral equation: 


a(t) =0+ f EU mean, 


where o, is a constant and Æ is a relaxation modulus. 


Example 13.1.2. Several models have been proposed over the many years to de- 
scribe population growth. A more general model that describes the rate of change 
of the population with time dP/dt is given by the following overall balance equa- 
tion. Here the population is assumed to grow proportionally to the current population 


cı P. Unlimited growth is restricted with a loss term —c2 P? and an additional term 
t 


for accumulated toxicity since time zero, —c3 P P(x)dx. Thus, c1, c2, and c3 are 


0 
constants where c4 is a birth rate coefficient, c2 is a crowding coefficient, and c3 is a 
toxicity coefficient. The resultant Volterra type integral equation for this model is 


dP 
dt 


t 
=c\P cP? ap | P(x)dx. 
0 


This integral equation is subject to the initial condition P(t = 0) = Py. 


Example 13.1.3. In nuclear engineering in the design of nuclear reactors, complex 
neutron transport phenomena can be modeled with reactor physics considerations 
employing the general Boltzmann transport equation. The integro-differential form 
of the time-dependent neutron transport equation is (Lewis et al., 2017) 


1 d 
—+V-24 %,(r, E,t) + Lar, E,t)| or, Q, E,t) = S(r, E, Q, t) 
Vel (E) dt 
oo 
, , , x(E) , i 
F Us(r, E, HSQ- Q, E >E) + y Vee et 
0 4r TT 


x or, X, E’, ÐdV'dE'. 


Here a neutron of energy E’ and direction Q’ will be scattered into energy E and 
direction &, where the differential scattering kernel fs accounts for the fractional 
probability of scattering. The function g is the neutron angular flux of the neutron at 
the position r, direction &, energy E, and time t. The speed of the neutron is ve; (E). 
The quantities Xs, X4, and & p are the macroscopic cross-sections for scattering, ab- 
sorption, and fission, respectively. The parameter x (E) is the probability of a neutron 
released from fission having energy E and v(E’) is the number of neutrons emitted 
per fission initiated with a neutron of energy E’. The overall source term S also al- 
lows for a production of neutrons by radioactive decay of certain isotopes and nuclear 
reactions. 


ee 
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Example 13.1.4. Solve the following Volterra integro-differential equation: ul" (x)= 


Xx 
x coshx — f t u(t)dt, subject to the initial conditions u(0) = 0 and u’ (0) = 1. 
0 


Solution. 
[0,6] 


Substituting a power series (see Section 2.5) u(x) = X anx" and a Taylor series 


© 2k is 
expansion for coshx = > x into the integro-differential equation yields 
k=0 (2k)! 
n—2 _ 
2 1)anx =x (È Sa) - f: Dar dt. 


Applying the initial condition u(0) = 0 in the power series expansion gives u(0) = 
[0,0] 
ag = 0. Similarly, u’ (x) = > nax”! implies u'(0) = a; = 1. Expanding the given 


n=1 
series produces the following terms with powers of x: 


2. la +3-2a3x +4-3a4x? + 5 - 4asx? 
l x2 x4 x 1 4 
=x + 71 + z See 3 hac SRETEN E 


Comparing each of the terms with various powers of x” yields a2 = 0, a3 = 


Br’ 
1 1 
a4 = 0, a5 = 3P and so on. More generally, a2, = 0 and a2n+1 = Gn 4D! forn > 0. 
x3 5 
Thus, the solution is u(x) = x + 31 + $I +...=sinhx. [answer] 


13.1.1 Solution methods of integral equations 


A solution of an integral equation is any function u(x) satisfying 


u=Ku+ fÍ, nonhomogeneous equation, 


u=AKu, homogeneous equation. 


The value of the parameter à for which the homogeneous integral equation has a 
nontrivial solution is called an eigenvalue of the kernel K (x, y), and the correspond- 
ing solution is called an eigenfunction of this kernel. One can distinguish eigenvalue 
problems for the integral kernel (integral equation) 


u=hKu 


e 
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and for the integral operator 


1 
Ku=-u. 
À 


Here the eigenvalues of the integral operator are reciprocal to eigenvalues of the in- 
tegral kernel, whereas the eigenfunctions are the same in both cases. 


General methods of solution are summarized below from BYU (2020) and Rah- 
man (2007). Solution methods used for the Fredholm equation include techniques 
such as successive approximation, Neumann series, and successive substitution us- 
ing the resolvent method. Successive approximation can also be employed for the 
Volterra equation. 


Successive approximation (Fredholm equation) 


The nonhomogeneous integral equation u = AKu + f can be solved iteratively using 
a method of successive approximations, such that 


uo(x) = f(x), 
Un(x) =AKuy_-1 + f (n= 1,2...). 


This methodology yields the solution 


Un(x) = XOAK f, K¥ = K(K(...K)). 


TEE aS 
k=0 


k times 


n 
The resultant series >> AŻ KÝ f is called a Neumann series and is convergent for 
k=0 


|A| , where M is a bounded number, that is, M > 0, such that for (x, y) 


1 
< — 
M(b—a) 
belonging to the interval [a, b], M = max |K (x, y|). The nonhomogeneous equa- 
x,ye[a,b] 
tion u=AKu-+ f can be equivalently written in the form (J —AK)u = f, where I 
is an identity operator. Hence, a solution can be obtained using an inversion operator, 


such that u = (I — 1K)~! f. The inverse operator exists if |A| < ————-. 
M(b—a) 


Example 13.1.5. Find the solution to the integral equation 


1 
way =e +2 uod 
e Jo 


Solution: r 
By inspection, K(x, y) = 1, f(x) = e*, b — a = 1, à = —, and M = 1, where the 
e 


1 
convergence criterion is satisfied: |A| < ———, that is, - < —. 
M(b—a) e 1-1 


D Sooo 
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This problem can be solved either (i) with an iterative approximation method or 
(ii) by a Neumann series. 


(i) Iteration method: We have 
Uo (x) == e~ , 


1 f! 1 f! 1 
TORTESJI wo(yydy =e += f edy=e*+1—--, 
é Jo e Jo e 


1 f! 1f! 1 1 
mo=e+ f mody= 41-i f (41-2) dyme' 41-5, 
e Jo e e e 


0 


1f! 1 
u(y e+e | Un—-1(y)dy =e* +1—-——. 
0 


en 
The solution is finally obtained by taking the limit of the nth iteration: 
u(x) = lim u(x) =e* +1. [answer] 
noo 
The solution is confirmed with substitution into the integral equation. 


(ii) Neumann series: 
Expanding the Neumann series 


SOK f= f(x) HAK! f HRK? ft... 
k=0 


The individual terms of the series are evaluated as 


1 
xref edy=e-l, 
0 


1 
aa (e—1)dy=e-l, 
0 


K" f=e-1. 


Inserting these terms into the Neumann series yields 


T je e piat eih 
e e g 


=A Ge Cs e a OSs 
e e p: 


e 
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zt (e—1) 

=e*—(e-1)+(e-l) -= -e +14 : 
ETA T= 

e 


=e*—e+l+e 


=e*+1. [answer] 
This result is the same as that obtained by the iteration method in item (1). 


Successive substitution — resolvent method (Fredholm equation) 


If the integral operator K has a continuous kernel, then one can have an iterated kernel 
K” = K (K""!) =(K""!) K (with n = 2, 3...). Now consider the given terms for 
the repeated operator: 


Kip = f KO WFO), 
Goa” 


K'(x,y) 
(K2 f(x) = IKKA) = I K(x,y) | I KOLDO) ay | dy! 
= f | f KESIR. ay | Toa 
G G 


K? (x,y) 


In general the iterated kernel K” (x, y) is 
K"(x,y) -=f K(x, y) K”! O", y)dy' = f K”! O, y) Kx, y)dy’. 
G G 


The resolvent is a function defined by the infinite series 


OO 
R(x, y,à) = se ae aaa es y). 
k=0 
Hence, the solution to the integral equation u = 7Ku + f with continuous kernel 


1 
K(x, y) for |A| < M(b—a) is 


b 
wey = faa f R(x, y,4) f(y) dy. 


a 


Example 13.1.6. Find the solution to the integral equation 


ree 
u(x) = 6" + 8 i x yu(y)dy using the resolvent method. 
0 


O O [— o SF = =e 
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Solution: 


23 
By inspection, K(x, y) = xy, f(x) = ey b-a=1,rA= 


1 
convergence criterion is satisfied: |A| < =-< : 
M(b-a) 8 1-1 


1 
3” and M = 1, where the 
1 


The iterated kernels are 


K'(x, y) =xy, 
1 1 137! 
y xy 
Rey =f Kl y)KO' y)dy = f syyyay =a% | ==, 
0 0 3 Jo 3 
1 1 1 137! 
K? (x, = K(x, y')K iyay'= f Syyay = 2/2] -3 
«=f (x, y)K(y’, y)dy p BY 3 31, 2 
xy 
BO ae 
The resolvent is evaluated as 
OO 
R(x, y, A= > AK, y) 
k=0 
l xy 1 xy 1 xy 1 xy 
Fern gg part gas pron” 
1 1 1 1 1 1 1 1 
Oe gga gn ga aa Wa ag 
1 24 
=x = 
y = 73 y 
24 


Hence, the final solution is 


b 23 1 (124 23 
wo) = soa f Rey) FOdY= Gx + 5 f xy —ydy 


1 


eel bale ee [ | 
= — x r A E = 4x. answer 
A a kere 


Successive approximation (Volterra equation of the second kind) 
The Volterra integral equation of the second kind, 


Te af ET E e), 


OOOO 
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can be solved using successive approximations, such that 
ug(x) = f(x), 


n 
un(x) = OMEN f= AKu + f. 
k=0 


This methodology yields the solution 


u&x) = Sx (x*f) (x), 
k=0 


Example 13.1.7. Find the solution to the integral equation 


x 
uay=1+ f uoy 
0 
using the method of successive approximations. 


Solution: 
By inspection, K (x, y) = 1, f(x) = 1, à = 1, and M = 1. 


The iterated kernels are 
K’ f = f&œ)=1, 
K'f=f Key (KF) = f 1: 1dy = [y =x, 
0 0 
x x 274* 2 
ep | KG, K! ay= ['t-ydy=[] =5 
fap Ken(K's)ordy= pf teydy=|5) =5 


x a J i 
3p 2 = ss = sd 2 
Kf=f K(x, y)(K A)oray= f i o= i 


n 


K' f=. 
n! 
22 oo yk ; 
Hence the solution is u(x) = > à% (K* f) œ=}, Pi e*. [answer] 
k=0 k=0 £- 


Laplace transform method 
The convolution theorem for Laplace transforms in Section 3.1.4 can be used to 
specifically solve an integral equation of the form 


x 


wo) = foy-+3 f K(x —t)u(t)dt, 
0 


ee = 
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where f(x) and K(x) are known functions. Applying a Laplace transform to this 
equation gives 


L{uy = Lf} + AL{K}L{u}. 


it TESE 
Solving for L {u}, Z {u} = IZK} and taking the inverse transform, the solu- 
tion for u(x) follows: 
— Lf) 
_ 1 
icc {sao} 


Example 13.1.8. Solve the equation u(x) = x — ie (x —ft)u(t)dt. 


Solution: i 
Here f(x) =x, K(x) =x, and à = —1. From Table 3.1, {x} = =: 
s 


Thus, making use of the inverse transform from Table 3.1, the solution is u(x) = 


gal {| = gZ! lal =sinx. [answer] 


13.1.2 Integral/differential equation transformations 


A differential equation can be recast as an integral equation or, vice versa, an integral 
equation can be converted into a differential equation. As mentioned, integral equa- 
tions provide an important and powerful method of solution in place of differential 
equations since the boundary or initial conditions are built into the integral equation 
rather than imposed as the last step of the solution method. This section demonstrates 
the connection of integral equations with boundary and initial value problems. 


Reduction of an initial value problem to a Volterra equation 


Example 13.1.9. Reduce the following initial value problem to a Volterra integral 
equation: u’ — 3x? u = 0, with initial condition u(0) = 1. 


Solution. 
Integrating the differential equation from 0 to x yields 


[ [won-30? won] ar=0. 


[wray-3 f orway=o. 
0 0 


Recognizing the antiderivative from the fundamental theorem of calculus for the first 
integral gives 


u(x) — u(0) — 3 f oway =0. 
0 


E -|-| | | 
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Imposing the initial condition u(0) = 1 gives the final result for the Volterra equation: 
x 
u(x) = 143 f y u(y) dy. [answer] 
0 


Here the kernel to the differential equation is K (x, y) = y?. 


Reduction of the Volterra equation to an initial value problem 


Reduction of the Volterra equation to an initial value problem involves consecutive 
differentiation of the integral equation with respect to the variable x and substitution 
of x = 0 for the setting of the initial conditions. 


Example 13.1.10. Reduce the Volterra integral equation u(x) = x? + te (x — 
y)? u(y) dy to an initial value problem. 


Solution. 
Substitute x = 0 in the integral equation to obtain the initial condition 


0 
u(0) = 0° +f (x — y)? u(y) dy =0. 


a 
0 


Differentiating the integral equation, using the Leibnitz rule for differentiation of an 


integral, 
d P% bœ) Jg(x, y) db(x) da(x) 
= gœ ydy= | EY! dy + glx, bN EE — gix, ay, 
XxX a(x) a(x) Ox dx dx 


gives u'(x) = 3x? + [} 2(x — y)u(y)dy, which implies u/(0) = 3 - 0? + IN 2(x — 
y)u(y)dy =0. 


Differentiating again gives 
x 0 
u” (x) =6x + af u(y)dy, which implies u” (0) = 6 - 0 + 2 f u(y)dy =0. 
0 0 


d 
Differentiating one more time, using the simplified Leibnitz rule — te. g(y)dy = 


g(x), gives the final result with a third-order ordinary differential equation, 


u” — 2u =6, with boundary conditions u(0) = 0, u'(0) = 0, and u’”(0) = 0. 


[answer] 


D 
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Reduction of a boundary value problem to a Fredholm equation 


Example 13.1.11. Reduce the boundary value problem y”(x) + y(x) = x to the 
Fredholm integral equation. Here x belongs to the interval (0, 7) with boundary con- 
ditions (i) y(0) = 1 and (ii) y(r) =x — 1. 


Solution. 
Letting y” (x) = u(x) and integrating gives 


f y'Odr= f wea 
0 0 


y'(@) — y'0) = Í eer 


so that 


Integrating again yields 


x x t2 
/ [y'(a) — yO] dt = Í p uaydn |an. 
0 0 0 


Carrying out the integration for the term on the left-hand side of the equation, 


x th 
y(x) — OEEO -=f p uoan | dtr. 


Using the repeated integration formula 


x th t3 t 1 x 
T f al / f(t) dtidtz...dty_\dty = a) (x— n”! f(t)dt 
o Jo 0 Jo (n— 1)! Jo 


for the right-hand side of the equation gives 
x 
y(x) — yO) — y'O) x = Í (x —t)u(t) dt. 
Employing the first boundary condition in the above equation produces 
x 
y(x)-1l—-y'O)x= Í (x—t)u(t)dt. 


However, y’ (0) is not known. This quantity can be determined by applying the second 
boundary condition with x = z in the previous equation and then solving for y’(0): 


yr) —1—y'(0)x z. (x —t)u(t)dt 
0 
so that 


ya)=na-1l= l+yor+ f (x — t)u(t)dt. 
0 


OOOO. 
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Therefore, 


; 2 1f 
y (0)=1 I (x —t)u(t) dt. 
T Iv 0 


Substituting y’(0) into the expression for y(x) provides the result: 


as =x zf Dudi f œ- Duid 
T T Jo 0 


Thus, substituting in this expression for y(x) and y” (x) = u(x) into the original dif- 
ferential equation yields 


u+fi+x = = [a nucar+ f -ouod =x 
TT T JO 0 


or 


EF 1+3 f a Dudi- fo Hundt 
TT T 0 0 


2 “rx x f” 
=Żx-1+ f [ke -n--p]uars= f (x —t)u(t) dt 
o Lx T Jy 


2 * t(w —x) ™ x(a —f) 
=—x- 1+ f Tuara f —— u(t) dt. 
T 0 T x T 


2 
Hence, the Fredholm equation is u(x) = — x — 1 + iy K (x, t) u(t) dt, with the kernel 
x 


t 
i. if O<t<x, 
K(x,t)= T 
SRT a poe 
Xx TT 
5 < 


Picard’s iteration method 


As discussed in Section 2.1.2, the first-order ordinary differential equation y’ = 
f(x,y), with the condition y(x) = yo, can be recast on integration as an integral 
expression: y’ = Yo + S f(t, y(t)) dt. As demonstrated in Section 2.1.2, this inte- 
gral equation can be solved using Picard’s iteration method. 


13.2 Green’s function 


As mentioned in Section 13.1, integral equations provide a means to incorporate 
the boundary conditions into a kernel. In fact, the nonhomogeneous Sturm—Liouville 
equation of Section 2.5.4 can be written as 


Ly(x) + f(x) =0, (13.3) 


D 
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where X is a self-adjoint differential operator acting on y(x), where 


d d 
La (w=) + q(x). (13.4) 


As a differential equation, Eq. (13.3) is solved for y(x) at specific boundary con- 
ditions at the endpoints in the interval [a, b]. However, as shown by Arfken et al. 
(2013), Eq. (13.3) can be recast into an integral equation where the solution y(x) 
involves a Green’s function G(x, t): 


b 
y(x) af G(x, t) f (t)dt. (13.5) 


The Green’s function in Eq. (13.5) is the kernel of the Fredholm equation of the 
first kind given in Section 13.1. It has the important symmetry property through a 
reciprocity principle that G(x, t) = G (t, x) (Arfken et al., 2013). The boundary con- 
ditions have been specifically built into the Green’s function G(x, t) via 


1 
-e t ’ < t, 
z“ ) a<x< 


G(x, t)= (13.6) 
T7“ Or, t<x<b, 


where u(x) and v(x) are linearly independent and each satisfies the homogeneous 
Sturm-—Liouville equation (that is, with f(x) = 0) as well as the given boundary 
conditions at x = a and x = b, respectively. Moreover, the Wronskian of these two 
functions (see Eq. (2.56)) requires W ¥ 0 for linear independence, thereby satisfying 
“Abel’s formula” 


A 
u(x)v (x) — v(x)u' (x) = re (13.7) 


for this Sturm—Liouville problem, with A being a constant. 


The problem in Eq. (13.3) and Eq. (13.4) can in fact be generalized further to the 
Sturm—Liouville eigenvalue problem of Eq. (2.178): 


Ly (x) + Ap(x)y(x) = 0. (13.8) 
This latter equation is a homogeneous Fredholm equation of the second kind. Since 


there are no specific restrictions on f(x) in Eq. (13.3), one can assume that f(x) = 
Ap(x)y(x) so that the solution of Eq. (13.8) simply follows from Eq. (13.5) as 


b 
yoy =a | G(x, t) p(t) f@adt. (13.9) 


a 
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Example 13.2.1. Consider the linear oscillator problem 
y"(x) +Ay(x) =0 (13.10) 
with the boundary conditions y(0) = y(1) = 0. 


Constructing the Green’s function, one needs to find solutions of the homogeneous 
Sturm—Liouville equation -Z y(x) = 0 or y” (x) = 0. As mentioned, to satisfy bound- 
ary conditions, one solution for the Green’s function must vanish at x = 0 and the 
other at x = 1. Thus, the functions u and v have a linear form ax + b in order to 
satisfy y” (x) = 0. Imposing the required boundary condition on each solution yields 
the (unnormalized) functions 


u(x) =x, 
PrE (13.11) 


Using Eq. (13.7), with r(x) = 1 for Eq. (13.10), we have uv’ — vu’ = —1 so that 
A = —1. The Green’s function follows from Eq. (13.6) (see Fig. 13.1): 


x(_-t), O<-x <t, 
G(x,t)= (13.12) 
t(l—-x), t<x<l. 


G(x,t) 


x(1-t) t(1-x) 


>x 


FIGURE 13.1 


Green’s function for a linear oscillator. 


Thus, with p(x) = 1 and f(x) = Ay(x) from Eq. (13.10), the solution using 
Eq. (13.9) is 


1 
yaaa f G(x, t)y(t)dt. (13.13) 
0 


The known solution of the differential equation in Eq. (13.10) is obtained from the 
result for the separated-variable function X (x) in Example 5.2.3 (with L = 1): 


y(x)=sinVax (A=n?x?, withn = 1,2, ...). (13.14) 


The exact solution in Eq. (13.14) does in fact satisfy Eq. (13.13). This result can 
be seen by noting the symmetry of G(x, t) from the reciprocity principle (that is, 


re = 


13.2 Green’s function 365 


exchanging x and ¢ in the Green’s function of Eq. (13.12) given that the dummy inte- 
gration variable is ¢ in Eq. (13.13)). Thus, inserting this symmetrical function G (t, x) 
along with the exact solution of y(t) = sinnzt into Eq. (13.13) and integrating yields 
the solution function y(x) in Eq. (13.14). This result therefore shows that the inte- 
gral solution of Eq. (13.13) is in fact a solution of the Sturm—Liouville problem in 
Eq. (13.10). 


Application of the Green’s function for nonhomogeneous problems 
Consider the Helmholtz equation in three dimensions with general spatial coordinates 


= 


r: 
V-u(r) +k uf) =0, (13.15) 


where k is a constant. This equation also reduces to the Laplace equation in the limit 
that k approaches zero (see Chapter 5). The equation has to be solved in a volume V 
subject to boundary conditions specified on the bounding surface. 

If the right-hand side of the Helmholtz or Laplace equation (see, for example, 
Eq. (13.15)) is zero, then the equation is said to be homogeneous. On the other 
hand, if the right-hand side of the equation is a function of position coordinates, it 
is nonhomogeneous. Such is the case for the Poisson equation as described in Chap- 
ter 5. The term on the right-hand side of the equation is sometimes called a driving 
term. 

Consider the nonhomogeneous Helmholtz equation with a general driving term 


f(r): 
Vu) + kuf) = f (7). (13.16) 


The method of solution for Eq. (13.16) involves the use of the Green’s function and 
is solved in the following steps (Towner, 2020; Boas, 2006): 


(i) Solve the equation 
VGE, r) +h GE, r) =5F,1), (13.17) 


subject to the same boundary conditions as for Eq. (13.16). The function ô (F, r! ) 
is the so-called Dirac delta function. 
(ii) Then, the general solution of Eq. (13.16) is 


ue) = fff GE, r) f ENT, (13.18) 
V 


where the integration is over the volume V under consideration. 
Here the Dirac delta function in Eq. (13.17) of step (i) has the following properties: 


(a) (x) = 0 for x £0, 
(b) f(a) = [R f(x)8(x —a)dx, 


e 
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where f(x) is a well-behaved function, which is integrated over the point x =a. A 
special case of the property in item (b) is when f(x) = 1 and the function is integrated 
over the origin (that is, a = 0) so that | es 6(x)dx = 1. This function can also be 
integrated over a finite domain such that 


fa), b<a<c, 


J OPSE | 0, otherwise. 


The proof that Eq. (13.18) is a solution of Eq. (13.16) is given as Problem 13.4. 


Most of the effort to find the solution in Eq. (13.18) is determining the Green’s 
function in Eq. (13.17) for the given geometry and boundary conditions of interest. 
However, once this function is found, it can be used for many driving functions f (7) 
provided that the geometry and boundary conditions are not changed. 

a 
Example 13.2.2. Solve the second-order ordinary differential equation “> t+y= 
f(x), subject to the boundary conditions y = 0 at x = 0 and x = 7/2, using the 
Green’s function method. 


Solution. 
One needs to determine the Green’s function from Eq. (13.17) such that 


a 
— G(x, x!) + G(x, x’) = d(x — x’). (13.19) 
dx? 

When x = x’, the right-hand side of Eq. (13.19) is equal to zero. In this case the dif- 


2 
s : : y c 

ferential equation reduces to the homogeneous equation ——> + y = 0, which from 
Section 2.2 has the solution as a linear combination of cosx and sinx. If we drop 
the function cosx at x = 0 and sinx at x = 2/2, the boundary conditions are au- 
tomatically satisfied. Hence, one expects the Green’s function to take the following 
form: 
; A(x’)sinx, x <x’, 

G(x,x ) = 

B(x) cosx, x>x. 

The functions A(x’) and B(x’) need to be determined. At x = x’, the Green’s function 
needs to (i) be continuous and (ii) have a finite discontinuity in its slope at x = x’, 
which can be determined from Eq. (13.19). Condition (i) gives 


A(x’) sinx = B(x’) cosx. (13.20) 


The second condition follows on integrating Eq. (13.19) from x = x’ — € and x = 
x’ + €, and then letting € approach zero: 


x’ Fe dG 7 x'+e x! be 
f east | Gorx')dx= f 5(x — x’) dx. 


2 
x'—e dx x/—€ x’-€ 


UU ll eee 
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Thus, integrating, noting that the second term on the left-hand side of the equation 


f me G(x, x’)dx tends to zero as € — 0 since G is continuous, and using the prop- 


erty of the Dirac delta function for the term on the right-hand side of the equation 
yields 

dG \x'+e 

dx xe 


Substituting G(x, x’) into this latter expression gives 


d / d à z 
— B(x ) cosx — | — A(x )sinx =1 
dx x!+e dx x!+e 


or 
— B(x’) sinx’ — A(x’) cosx’ = 1. (13.21) 
Solving Eq. (13.20) and Eq. (13.21) gives A(x’) = — cos x’ and B(x’) = — sin x’, so 


that the Green’s function is determined as 


i —cosx’sinx, x <x’, 
G(x,x)= 


—sinx’cosx, x>x’'. 


For the second step, using Eq. (13.18) and the Green’s function, the solution for y(x) 
that satisfies the boundary conditions is 


m/2 
y(x) -f G(x, x’) f(x)dx" 

x m/2 

=) (— cosx sin x’) f (x^) ax + f (—sinx cosx’) f(x)dx’ 
0 X 
x m/2 

= -cosx | sinx’ f (x^) dx’ — sinx f cosx’ f(x’) dx’. [answer] 
0 


x 


The methodology of Example 13.2.2 can be generalized for the following second- 
order ordinary differential equation: 


y" + p(x)y’ + q(a)y = f(x) 


in the range a < x < b and with boundary conditions y(a) = 0 and y(b) = 0. 
The first step is to find the Green’s function, which must satisfy 


2 
£ GG, x+ a ow + q(x)G(x, x") = d(x — x’), (13.22) 
dx dx 


and the two boundary conditions so that G(a, x’) and G(b, x’) = 0 when x Æ x’. 
Let yı (x) and y2(x) be the two independent solutions of the homogeneous equation 
y” + p(@x)y’ + q)y =0. 


E 
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Similarly, one can postulate a solution of the Green’s function of the form 


G(x x’) = A(x')yi x), a<x< x’, 
; B(x')y2(x), x! _— <b, 


which satisfies the boundary conditions. Again the functions A(x’) and B(x’) at 
x =x’ need to ensure that the Green’s function (i) is continuous and (ii) has a fi- 


nite discontinuity in its slope at x = x’ that can be determined from Eq. (13.22). 
Similarly, the first condition yields 


A(x’) yi(x’) — B(x’) yx) = 0. (13.23) 


Again, the second condition follows on integrating Eq. (13.22) from x = x’ — € and 
x =x’ + €, and then letting € approach zero: 


x'+e d?G(x, x’) x'+e dG(x, x’) x’ +e , 
— 3 + pa) — — dx + g(x)G(x, x’) dx 
x! 
—_—<—<—s__-__’ 


Ixe dx? (/—€ x!—€ 
I II II 
x'+e 
= l ô(x— x") dx. 
x’—€ 
a H 
IV 
dG \x'+ d d 
Integral I = — = — B(x')y2(x) — | —A(x')y1 (x) 
dx |\x'-e dx x'+e dx yie 
d d 
= BG’) y2(x) AG’) yix) 
dx |x'+e dx (= 


= B(x) y +€) — Axy a — €) 
=B) -Aya (e> 0). 
x'+e x'+e 
Integral ZI = [poo | = f de Gdx —0 


dx Jw-e dxe dx 


d 
(since p, G, and 7 are continuous at x = x’). 
x 
x'+e 
Integral Z = f q(x)Gdx—0 (since q and G are continuous at x = x’). 
x'—e 
x'+e 
Integral w= | b(x —x')dx =1. 


x'—e 


Integral J is integrated by parts. Thus, the second condition leads to 


B(x')y9(x') Ay). (13.24) 


ee = 
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Solving Eq. (13.23) and Eq. (13.24) gives A(x’) = y2/W and B(x’) = y,/W, where 
W(x’) is the Wronskian as defined in Eq. (2.56): 


W = yi (Ny aN — yo(x') y; aA. 


Hence, the Green’s function is determined as 


, | yo(x')yi(x)/Wx), asx <x’, 
G(x,x)= 
yi(x')y2(x)/W(x'), x’ <x <b. 


Given the solution for G(x, x’), the solution for y(x) that satisfies the boundary con- 
ditions is 


b 
yx) -| G(x, x^) fx’) dx’ 


PAO oA > yxy) 2 nay 
=] Wo) D f(x)dx i Wo) A f(x)dx 


Z f WIFE) er yc) [ PEIGI ay 
TM J Wo) We) 
= c1y1 (x) + cr y2(X) + ypx). 


In the formulation of the second equality for the first integral, the limits for x’ range 
from a < x’ < x, hence G(x, x’) = G2(x, x’), while similarly, for the second integral, 
G(x, x’) = G, since x’ ranges from x < x’ < b. Here cı and cp are the values of the 
integral taken at the limits b and a, respectively. The function y,(x) is determined 
from the integrals evaluated at the limit x: 


E KOO sofo 
yl) = yatx) f PEO a = vi) f ae 


which are now written as indefinite integrals. 


pee ge (13.25) 


This result follows directly from the analysis of second-order ordinary differential 
equations in Section 2.2.5, where yp in Eq. (13.25) is identical to Eq. (2.64). The gen- 
eral solution to the ordinary differential equation can be written as a general solution 
of the homogeneous equation (see Eq. (2.57)) called the complementary function, 


yh (x) = yi(x) + y2(x), 


and a particular solution y,(x) involving the driving term. The Green’s function 
therefore offers another method to find a particular solution. 


The result in Example 13.2.2 easily follows from this general solution. The two 
linearly independent solutions to the homogeneous problem y” — y = 0 are yı (x) = 
sinx and y2(x) = cos x. Note that yı satisfies the boundary condition at x = 0 and y2 


OOOO. 
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satisfies the boundary condition at 2/2. The general solution to the nonhomogeneous 
problem is 


NODS] gy Faa f * yO IFO) | 


yoo) = ata) f AOE a 


where 
WAN = y NA) — 2&1 @) 
= sin x'(— sin x’) — cos x’ (cos x’) = — (sin? x’ + cos? x') =-—1, 


Thus, the same result is obtained as given for the linear oscillator in Exam- 
ple 13.2.2: 


m/2 


x 
y(x) = -cosx f sinx’ f(x)dx’ — sinx f cosx’ f(x!) dx’. 
0 


x 


Problems 


13.1 Solve the following integral equation using a Laplace transform method: 


t 
y(t) = sint +f y(t) sin(t — tT) dt. 
0 


13.2 Consider the Sturm—Liouville problem in Example 13.2.1. This problem can 
be recast as the second-order ordinary differential equation y’(x) = f(x) with 
boundary conditions y(0) = y(1) = 0 on letting A = —1, p(x) = 1, and f(x) = y(x) 
in Eq. (13.10). Thus, the solution follows from Eq. (13.13) as 


1 
w= Gas FO ds. 


Here the Green’s function is symmetric (that is, G(x, s) = G(s, x)) and is defined as 
the negative quantity of Eq. (13.12) (which satisfies the boundary conditions): 


s(x— 1), O<s<x, 
x(s— 1), x<s<l. 


cas =]| 


(a) Show that for f (x) = x?, the solution using the Green’s function formalism is 
ya) =< (3-1), 

(b) cae that 
dG(x, s) =Í x-1, 0<s<x, 


ðs X; x<s <l, 


ao OOE O 
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so that (9G (x, s)/ds) =x — 1+ U (s — x), where U (s — x) is the unitary step function 
(or Heaviside function) (see Section 3.1.3): 


0 if s<x, 
ue-x)={ 1 if s>x. 
(c) Given that the derivative of the Heaviside function is the Dirac delta function, 
a°G(x, s) 


ðs? 
X+€ 


of the Dirac delta function f ô(s — x) f (s)ds = f (x), show that one obtains the 


show that from part (b) 


= ô(s — x) = ô(x — s). Hence, using the property 


Xe By 2G 
original ordinary differential equation —5 = / —~ (x, s) f(s)ds = f(x). Thus, 
dx? ax? 
1 
yx) = G(x, s) f(s)ds is a solution of the nonhomogeneous boundary value 


problem. Moreover, the Green’s function can be thought of as a response function 
to a unit impulse at s = x. 


13.3 Consider Example 5.2.11 for heat conduction in a semiinfinite solid but where 
the temperature is initially zero in the solid, and at t > 0 a constant temperature of uo 
is applied and maintained at the face x = 0. The initial and boundary conditions for 
this problem become u(x, 0) = 0, u(0, t) = uo, and |u(x,t)| < M. 

(a) Using a Laplace transform method with “#{u(x, t)} = U(x, s) and the trans- 
formed boundary condition U (0, s) = #{u(0, t)} = uo /s, show that the transformed 
solution is U(x, s) = (= 
lem 10.3, the final solution is u(x, t) = LU (x, S)} = üo erfc[x/(2Vkt)]. 

(b) If the boundary condition u(0, t) = uo is replaced by the more general condi- 
tion u(0, t) = g(t), show that the transformed solution is U(x, s) = G(s) eV STR x 


(c) From Problem 10.3, Lo 'NeVs/kxy = a ee Hence, show 


TK 
that using the convolution theorem (see Section 3.1.4) for the inverse transform of 


part (b), the solution for u(x, t) is given by the integral equation 


e~v°/kx_ Thus, using the inverse transform from Prob- 


t 
x —3/2 ,—x?/(4«u) 
a —— t—u)du. 
u(x,t) [ 5 =u e g(t—u)du 


13.4 Prove that Eq. (13.18) is a general solution of the Helmholtz equation in 
Eq. (13.16). 


13.5 Classify each of the following integral equations as a Fredholm- or Volterra- 
type integral equation, as linear or nonlinear, and as homogeneous or nonhomoge- 
neous, and identify the parameter A and the kernel K (x, y): 


(a) u(x) =x+ fy xyu(y) dy, 
b) u(x) =1 4x24 fo — y)u(y) dy, 


a 
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(©) u(x) =e" + fy yu y)dy, 
(d) w(x) = fy —yPu()dy, 
Goei a Sg 
u(x) = 
4°° (x+y) u(y) 
13.6 Convert the following Volterra integral equation into an initial value problem: 
u(x) =x + fo O — x) u(y) dy. 


13.7 Convert the following initial value problem into a Volterra integral equation: 
u” (x) + u(x) = cos x, subject to the initial conditions u(0) = 0 and u’ (0) = 1. 


y. 


13.8 Convert the following boundary value problem into a Fredholm integral equa- 
tion: y"(x) + y(x) = x, for x € (0, 1), subject to the boundary conditions y(0) = 1 
and y(1) = 0. 


13.9 Given the Fredholm integral equation u(x) = x + à if (xy) u(y) dy, solve this 
equation using (i) the successive approximation method, (ii) a Neumann series, and 
(iii) the resolvent method. 


13.10 Given the Volterra integral equation u(x) = 1+ h (x — y)u(y) dy, solve this 
equation using the successive approximation method. 


CHAPTER 


Calculus of variations 


Variational calculus employs an integral representation to find an optimum quantity 
to be minimized (or maximized). It complements ordinary differential calculus as 
presented in previous chapters. It has applications for problems in physics to find 
the motion of macroscopic objects (classical mechanics) or to determine an optimal 
path. It can also be used in engineering for assessment of an optimal shape of an 
object (for example, an aircraft wing in order to minimize wind resistance), or to 
find a “utility function” in economics for measurement of the usefulness of specific 
consumer goods. 

As mentioned, this technique can be used to find the shortest route or distance on a 
surface (a geodesic). In this process, one searches for trajectories that minimize a so- 
called “action,” which corresponds to classic equations of motion for a given system. 
Specifically, one minimizes the value of an integral using a “principle of stationary 
action.” The action is an integral over time S = Je Ldt of a “Lagrangian” L. Thus, 
the action S represents an integral of the Lagrangian for an input evaluation over time 
for the development of a system from an initial time to a final time: 


t 
S= | Ugt).4.nat. (14.1) 
ti 


Here q and q are generalized coordinates and the dot pertains to a derivative with 
respect to the independent variable time t. The endpoints of the evolution are fixed 
as qi = q (t1) and q2 = q (t2). According to “Hamilton’s principle,” the true evolution 
of a system is one where the action is stationary (that is, 6S = 0, thereby involving a 
minimum, maximum, or saddle point of S). This methodology has been historically 
termed “Lagrangian mechanics” and results in the equations of motion. 


14.1 Euler-Lagrange equation 

A fundamental differential equation that is derived from the calculus of variations is 
the Euler-Lagrange equation. This equation is derived from Eq. (14.1) by applying a 
stationary action: 


2 2/aL ƏL 
s= | Lig.d.ntar = f (Z4 aa) dt (14.2a) 
ti ti ðq ðq 
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2 (ƏL ƏL o) 
a ôq + = dt =0. (14.2b) 
I (3; 1. aq di 
Integrating the last term of Eq. (14.2b) by parts on letting 
OL 
= and dv=d(6q), 
q 
d (aL r ; P (14.3) 
= — | — an = 
“= u \ag a 
yields 
2 (aL dis 2 OL ƏL |?) 2 (d aL 
f (5 ca) a= f of aeay= | 84 f ( cai) 4 
n \3q dt n Og ðq ü n \dt ðq 
(14.4) 


Inserting Eq. (14.4) into Eq. (14.2b) gives 


h to 
2 2f/dL doaLt 

+f (= = aa) dqdt = 0. (14.5) 
ä n \0q dtôõå 


Since only the path is varying and not the endpoints, we have ôq (t1) = ôq (t2) = O 
and Eq. (14.5) becomes 


ss= [" (2 Taq) bat =0 (14.6) 
~ J, qg atag) TTS ' 


aL 
5S = — ôq 
ðq 


Hence, for an arbitrary small change ôq for a stationary action the integrand itself 
must equal zero, so that 


— -—-—— =0, (14.7) 
ðq dt oq 


which is the Euler-Lagrange equation. 


Example 14.1.1 (Second law of motion). Use the Euler-Lagrange equation to derive 
Newton’s second law of motion. 


Solution. 

Let the generalized coordinate be position for the dependent variable so that q(t) > 
x(t) andq (t) > X(t), where i = dx /dt is the velocity. The Euler-Lagrange equation 
in Eq. (14.7) can therefore be written as 


doL ƏL 

dt 0x ðx 
In this case, the Lagrangian can be defined as the difference between the kinetic 
energy T and potential energy V of a system: 


=0. (14.8) 


L=T-V. (14.9) 
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The kinetic energy of a particle of mass m is 


1 
T = —mx?. 


(14.10) 


The negative gradient of the potential energy V (x) is defined as the force F: 


F@= a, (14.11) 


Thus, substituting Eq. (14.9) into Eq. (14.8) and using Eq. (14.10) and Eq. (14.11) 
for the kinetic and potential energies, respectively, yields 


d aV) 


P7 (mx) dx mx — F(x) =0> F=ma, (14.12) 


which is Newton’s second of law of motion, where a is the acceleration such that 
a=Xx. 


Example 14.1.2 (Straight line between two points). Using the Euler-Lagrange equa- 
tion, show that a straight line is the shortest distance between two points on a flat xy 


plane. 
yî 
Distance 
element 
dx 
a 
FIGURE 14.1 
Schematic of a distance element in flat space. 
Solution. 
Using the Pythagorean theorem (see Fig. 14.1), the small distance element on a flat 
surface is 
ds =[(dx)* + (dy)*]'? = [1 + y2]! dx. (14.13) 


Hence, the distance may be written as 


X2,y2 X2,y2 X2 
s= as= f (idx)? + (dy? = f [1+ y2]'/dx. (14.14) 
w * 


1,Y1 15Y1 xı 


ER 
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Letting the generalized coordinates in Eq. (14.1) and Eq. (14.7) become q > y, g > 


ii, and t > x, 


dx 
x2 
s=[ Ly, yx, x]dx (14.15) 
X1 
and 
OL d ƏL 
pleas = (14.16) 
dy dx Oyy 


Comparing Eq. (14.14) with Eq. (14.15), the Lagrangian L for this problem is 
L=[1 + yy”. (14.17) 


Therefore, substituting Eq. (14.17) into Eq. (14.16) yields 


d Yx = Yx a 
Tx [ml> gme (a constant). (14.18) 
The solution of Eq. (14.18) is 
yx=a (aconstant) > y =ax +b. (14.19) 


The second relation in Eq. (14.19) is the familiar equation of a straight line where 
a and b are constants so that the line passes through (x1, y1) and (x2, y2). Thus, the 
Euler-Lagrange equation predicts that the shortest distance between two fixed points 
is a straight line on a flat plane. [answer] 


This result is further generalized in Example 14.1.3, where the shortest distance in 
four-dimensional spacetime is a geodesic, which is a basic construct of general rela- 
tivity. 


Example 14.1.3 (Geodesic equation). Derive the shortest distance on a curved sur- 
face using the Euler-Lagrange equation. 


Solution. 

The methodology in Example 14.1.2 can be generalized using a metric g,, for 
four-dimensional spacetime to account for any type of surface and for any general 
coordinate system: 


ds? = Sy ydxydry. (14.20) 


For instance, to represent Eq. (14.13) with two dimensions (i.e., with indices u = 1, 2 
and v = 1, 2), Eq. (14.20) can be written in the following component form using the 
Einstein convention of summing on repeated indices (Lieber and Lieber, 1966): 


ds? = gidx,- dx, + gindx,-dx2 (14.21) 
+ g21dx2- dx, + go2dx2-dx2, ` 
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where g, considering the previous example of a flat (Euclidean) surface, is the 2 x 2 
matrix |81 812| = 19 ; 
821 82 0 1 
Following the methodology of Example 14.1.2, Eq. (14.14) can be similarly writ- 
ten using the generalized Lagrangian function 


L = // gapiuãg, (14.22a) 


where the action is 


à 
s=f L(xaæ, ža, A) dX. (14.22b) 
à 


0 


dXa(àÀ 
Here the path is parametrized by A, in which xa (à) = a (see Fig. 14.2). 


N 
p, A 
am Ag J 
"i fs / 
_— — Y ds A 
Ae = ae 
me ae ae 
FIGURE 14.2 
Various paths between two fixed points A and B. 
Thus, in summary, the action for this problem is 
à à 
s=[ ds =) Sapraxp dà. (14.23) 
ho ho 


The Euler-Lagrange equation with generalized coordinates in Eq. (14.7) can be writ- 
ten to minimize the path length as 
OL d ðL 
—— — —— =l. (14.24) 
OX, dà Xu 
As follows from the derivation in appendix B using the Lagrangian formulation, 
the equation of a geodesic is 


x) + žais, = 0, (14.25) 


0 os 
where the Christoffel symbol T's, = Fg” [85 Sue + v8 — 91805] and Ogi; = = ; 


For instance, for ordinary three-dimensional (Euclidean) flat space where v = 1, 2,3 
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with coordinates x1, x2, and x3 and choosing Cartesian coordinates (that is, g1; = 
822 = 833 = land guv =O for u Æ v), V's, = 0 since the derivatives of these constant 
quantities of the metric in Eq. (B.9) are zero. Thus, Eq. (14.25) reduces to x, = 
d?xy . d?xi d?x d?x3 . 

= 0, or in component form, =0, = 0, = 0. The solution of 
di aie ” dX di? di 
each component equation is a straight line for the geodesic, analogous to Eq. (14.19) 
in Example 14.1.2. In particular, the solution of these equations (that is, for x = 
X1, Y = X2, Z = x3) yields: x = x9 + Aa, y = yo + àb, z = zo + Ac where all other 
quantities are integration constants. Thus, solving for A yields the Cartesian equation 
x—X0  Y—yO 2-20 

b c 


of a line: 


Example 14.1.4 (Equivalent form of the Euler-Lagrange equation). The Euler— 
Lagrange equation in Eq. (14.8) can be written as 


af dəf _ 


dy dx dy! 


where f is a function of f(x, y, y’). Show that this formulation is equivalent to 


d , OF Of _ 
nl rd = 


This latter form is useful if f is not explicitly a function of x, as shown in Prob- 
lem 14.1 and Example 14.1.6. 


Solution: 
Using the chain rule, 


df ofdx  odfdy | dof dy’ 

dx axdx  dydx dy’ dx 
_ of of , Əf p 
-k aye + ay? 


(14.26) 


In order to eliminate the last term in Eq. (14.26), one can use the chain rule on the 
following relation: 


d (of ,d ( of af n 
= —y", 14.27 
dx (> =) ? ax (32 dy’ ( ) 
Thus, subtracting Eq. (14.27) from Eq. (14.26) yields 
d of of 3 d of 
f=y syl =}. (14.28) 
dx dy Ox dy dx dy 


=0 
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Since the term on the right-hand side of Eq. (14.28) in curly brackets is the Euler- 
Lagrange equation that equals zero, the result follows: 


d , OF Of 
a |; vi =0. (14.29) 


Example 14.1.5. The solution of the brachistochrone problem is an important histor- 
ical result in the 17th century solved by Bernoulli for finding the optimal path from 
which an object will fall from one point to another (see Table 1.1). Consider a bead 
of mass m that is able to slide down a wire by gravity without friction to the point 
P2 in Fig. 14.3. Find the shape of the wire so that the bead falls from the origin 0 to 
point P2 in the least amount of time. 


aks : 
QW SS öl i \ 
| SeP(sy) | ay} NE 
| NA ley 
|y Palsy) ~< 


| 


FIGURE 14.3 


Schematic of a bead sliding down a wire without friction for a brachistochrone problem. 


Solution. 
The equivalent form of the Euler-Lagrange equation in Eq. (14.29) can be used to 
solve this problem. Using a conservation of kinetic and potential energy at the point 


ds\? d 
0 and P2, 0 + mgyi = 5m (=) +mg(yi-y)=> = = ./2gy. Using this result, a 


dt 
T 1 x2 /] +y? 


separation of variables yields the total time T: T = f dt = 


0 V2eto VV 


dy\ 1/2 
Here (from the insert picture), the relation ds = (: + z) dx has been used to 
x 


convert from ds to dx in the second integral. The function f = y1 + y’*/,/y does 
not depend on x, so that Eq. (14.29) reduces to f — Y = c, Where c is a con- 
y 
stant. Thus, inserting f into the reduced Euler-Lagrange equation and solving for 
dy a-y 


y’ yields y’ = ae , where ./a = 1/c. This relation can be separated and 
x 


y 
integrated on both sides, yielding / dx = I | 2 dy. Using a change of vari- 
a—y 


ables for y, where y = asin6, this integral can be evaluated as x = 2a f sin? 6 d8 = 


S 
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O 27b 4zb_ X 
A g ) 
| aw Se = 
| 
R4 
FIGURE 14.4 
Curve represented by a cycloid with a fixed point A on a circle of radius b as it rolls along 
the x axis. 
interface 
light ray 
normal 
FIGURE 14.5 


Derivation of Snell’s law. 


a fo — cos 20) d0 = (20 — sin 20) + k. The integration constant k = 0 since the 


curve passes through the origin. Letting b = a/2 and ¢ = 20, the curve in Fig. 14.4 is 
given by x = b(ġ — sing) and y = b(1 — cos ġ), which represents a cycloid. [an- 
swer] 


Example 14.1.6 (Snell’s law in optics). Consider the propagation of light through a 
medium. The index of refraction is defined as n = c/v, where c is the speed of light in 
a vacuum and v is the phase velocity of light in the medium (e.g., n = 1.3 for water) 
(see Fig. 14.5). The refractive index determines how light is bent from the normal to 
the surface when entering a material. Derive Snell’s law using a variational technique 


x2 d 
since the light in a medium takes a minimum travel time so that 6S = f as 0. 
x, V 
Solution. 
x2 /1 12 
From Example 14.1.2, ds = 1+ (y’2)dx, so that 6S = f — dx = 
X1 v y 


x2 
f f(y, y) dx. Since f(y, y’) does not depend on x, the equivalent form of the 
xı 


d 
Euler-Lagrange equation in Eq. (14.29) can be used, yielding T (f —y' fy) =0. On 


vity? oy? O 


v vJ/I+y2) vf/l+y2 — 
constant. Letting 0 in Fig. 14.5 be the angle between the tangent to the surface and 


integrating, f — y’ fy = constant so that 


S 
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d 1 1 
the light path, sind = — = ————. Therefore, as follows, — sin = constant. 
8 P ds / 1 + yl? Vv 


However, from the definition of the refractive index, n « 1/v, so that Snell’s law 
follows: 


n sin@ = constant. [answer] 


Generalizations of the Euler-Lagrange equation 


Multidimensional generalizations of the Euler-Lagrange equation for determining an 
extremum with several functions and/or variables are given as follows: 


(i) For several functions (fi, f2,..., fm) of a single variable x and derivative, the 
extremum of the functional 


X1 df; 
Is Paros Sd = ff LAX, fis faa Jas Fos Flo on fds fl = 2f 


i dx 


satisfies the Euler-Lagrange equations 


ƏL d (ƏL\_ 0 
af, dx Ə) ` 
(ii) For a single function with n variables x with a single derivative for a surface Q, 
the extremum of the functional 


O) 
n= f L {x1, X2, Xn f, fis fas -> Jfa} dX; fj = st 
Q xj 


satisfies the Euler-Lagrange equation 


aL 5 a (aL 
( ) E k 
Of F xj NIS) 


(iii) For several functions with several variables with a single derivative, the ex- 
tremum of the functional 


Ifi, fas fm] 


-f L fxi, e.. Ñn, fi e.. Ím, fii soby Fins viy Ím., *¥e3 fmn} dx, 
Q 


afi 
Ox; 


fij = 


satisfies the system of Euler-Lagrange equations 


ƏL Æ 3 Of OL 
D =01, 
Of, ay 887 NAA 


M 
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ab Å 8 /ƏL 
Dal) 
afr ya Od Ofe, j 


n 


aL ð T ) 
= On. 
Əfm =] Ox dfn, J 


J 


For example, from case (ii), if the function f = f(t, X,Y,Z, U, Ut, Ux, Uy, Uz), 
where the dependent variable u = u(t, x, y, z) and the independent variables are t, x, 
y, z, the Euler-Lagrange equation can be generalized as 


3f 3 ð ð 
P ae 5 (Fn) + 5 (n) + E (fa). (14.30) 


Example 14.1.7 (Vibrating string equation). Derive the equation of motion from the 
Euler-Lagrange equation for the vibration of an elastic string of length £ in Fig. 5.2 
with a displacement u(x, t). 


Solution. 
Let an element of an unstretched string at equilibrium be dx and let the corresponding 
length for the stretched string be ds. Hence, using Eq. (14.13) along with a bino- 
mial expansion, the lengthening of the string is ds — dx = /dx? + du? — dx ~ 
1 (du? 1 (au\? . PEN 
1+-={—J] +..|dx— dx ~ -| — | dx. The potential energy density is re- 
2 \ðx 2 \ðx 


dV 
lated to the restoring force: F = OTEN Given that the restoring force is the 
s— dx 


1 (au? 
string tension T, dV = T (ds — dx) = 5 (+) , on integrating, the potential energy 


l 
is V = re f u?dx. The kinetic energy density of the string with a constant mass 
0 


ee ae OB Giant Gk 2 T au)? g 
er unit ien 1S = = — x n integrating, = > — x. 
p gth p ater grating Pa 


In accordance with Hamilton’s principle over the interval from t; to t2, the path is 
minimized such that 


t2 
ti 


Thus, in this example, 
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t2 
s= | (K — V)dt 


ti 
nh ply 14.31 
= f Í - (pu? — Tu?) dx dt een 
ti 0 2 : 
=0, 


to £ 1 
which has the form af / L(u;,ux) dx dt, where the Lagrangian L = 5 (pu; — 
ti 0 


Tu?). Hence, as follows from Eq. (14.30), 


ƏL əð/ƏL ða (ƏL 
=0. (14.32) 
ðu ðt \ Ou; Ox \ðux 


Using the Lagrangian L, Eq. (14.32) yields 


ð 0 
— =T — x 
pa (ur) Ae (ux) 


or 
ə?u 2 Ə?u 
ar2 i 3x?” 
where c = yT /p is the speed of the string wave. This equation is the one-dimensional 
wave equation for a vibrating string in Eq. (5.10). 


[answer] 


14.2 Lagrange multipliers 


Lagrange multipliers can be used for minimization problems when there is a con- 
straint. For instance, given a function with three independent variables f(x, y, z), the 
minimization (or extremum) of the function requires df = 0. Given the constraint 


p(x, y,z) =0, (14.33) 


the function f(x, y, z) can be minimized with the use of a Lagrange multiplier such 
that (Arfken et al., 2013) 


ð ð 
`f +£ =0, 
X X 
(14.34) 
ð 
U pg Ea 
dy ay 


The Lagrange multiplier can also be incorporated into the calculus of variations. 
By Hamilton’s principle, one seeks a path that is stationary: 


f [ra di, t) + 5 AKL) RG, J dt =0. (14.35) 


k 
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The Lagrangian equations of motion now have an added constraint that includes the 
Lagrangian multiplier Àg: 


doL ƏL 
et LN ae 14.36 
doa 8a 2an k ( ) 
where 
0 
dp Ak (14.37) 
Ogi 


In particular, if q; is a length, then a;,A,% represents the force of the kth constraint in 
the g; direction analogous to Eq. (14.11). 


Example 14.2.1 (Minimum critical volume for a cylindrical reactor). Find the mini- 
mum critical volume for a finite cylindrical nuclear reactor of height H and radius R. 
There is a constraint from neutron diffusion theory that for the geometric buckling of 
the reactor By 


2.4048 \? 2 
o(R, H) = (=) + (=) = B? = constant. (14.38) 


Solution: 
Using Lagrange multipliers in Eq. (14.34), the function f for the volume of a cylinder 
is (Spiegel, 1973) 


f(R,H)=V =x7R°H. (14.39) 
Hence Eq. (14.34) yields 
3 2.4048)? 
OF oa Coann O Sy 
ƏR ƏR 3 
(14.40) 

ôf 5 OP aR- =, 
3H “3H ” 35 


Multiplying the first equation by R/2 and the second one by H, 


(2.4048) „27? 


rR H=X - =k (14.41) 
yields 
2x R 
H = 2.4048 = 1.847R. [answer] (14.42) 


Moreover, using Eq. (14.38), Eq. (14.39), and Eq. (14.42), the critical volume in terms 
of the buckling B, is V = 


3 
E 


. These results are given in Table 2.14 of Lewis et 


al. (2017). 
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Problems 


mv? 


14.1 Consider a bead on a rotating wheel, as shown in Fig. 14.6. Here F = —— 
r 


2 
F =—— bead rotating wheel 


FIGURE 14.6 


Schematic of a bead on a rotating wheel. 


is the centrifugal force, m is the mass of the bead, v is the tangential velocity at 
the radius r of the wheel, and n is the number of rotations per second such that 
w=2rn. 


1 
(a) Show that the kinetic energy in the problem is T = 5m. Given that v = wr 


mær? 


2 
(b) Using the Euler-Lagrange equation, show that the equation of motion is 
F =&@r, which has the solution r (t) = Ae~®! + Be”. 


dV 
and using Eq. (14.11) such that F = TI show that V = — 
r 


14.2 Consider a pendulum as shown in Fig. 14.7 of mass m and length /. The angle 
of the pendulum from the vertical is @ with a velocity of v =1dq¢/dt. Using the 


oh 


FIGURE 14.7 


Schematic of the pendulum for an Euler-Lagrange analysis. 
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Euler-Lagrange equation in Eq. (14.8), with the Lagrangian L defined by Eq. (14.9) 
as the difference between the kinetic energy (T = sm v?) and the potential energy 
(V =m gh), show that the equation of motion for the pendulum is given by 


do — gsing = 


— 0. 
dt? I 


Hint: The state variable for this problem is the angle @ that the pendulum makes from 
the vertical. 

14.3 Consider a small block of mass m sliding a distance s down a moving wedge 
of mass M, as shown in Fig. 14.8. The wedge is also sliding a distance x along the 
floor. Neglecting frictional effects, calculate the following: 


y 


s 
Cae X 


FIGURE 14.8 


Schematic of a sliding block on a moving wedge. 


(a) Given that the velocity of the wedge is simply x, show that the x- and y- 
components of the velocity of the small block are vy = x — Scos@ and vy = —s sin 0, 
respectively. 


1 
(b) Show that the total kinetic energy of the combined system is T = 5 Mi + 


1 
zma? + 5? — 2x5 cos 0). Also show that the potential energy for the small block is 
V = —m g s sin, where g is the acceleration due to gravity. 


(c) Determine the Euler-Lagrange equations for this problem for the two variables 
s and x and their derivatives. 


(d) Solving the Euler-Lagrange equations in part (c), show that the two accelera- 
tions § and ¥ are given by 


(M +m) . 5 M ; 
§ = ——— 8sinf =c] and x = ——— 8 sinf cos? = c2, 
(M + msin~ 0) (M + msin4 0) 


where cı and c2 are simply constants. 
14.4 Show that the shortest curve that has an area A below it is a circular arc, 
(Ax —c)?+(ay—d)y*=1, 


as shown in Fig. 14.9. Here à is a Lagrange multiplier constant and c and d are 
constants of integration. For this constrained problem, y(0) = a, y(1) = b, and 


A= fy y(x)dx. 
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Optimal curve 
`“ 


“1/À 


1/2 


Area below arc is A 


FIGURE 14.9 


Schematic of a circular arc for the optimal shape of a curve with an area under it. 


Hint: Use the Euler-Lagrange equation in Eq. (14.36) with the constraint g = A. 
From Example 14.1.2, the Lagrangian under consideration is given by Eq. (14.17). 


APPENDIX 


Maple software package 


This appendix provides a list of key commands for the commercial software package 
Maple (Maplesoft, 2015). Maple is an analytical and numerical solver tool that can 
be used for general mathematical analysis. This tool can be used in the book for rapid 
calculation of various mathematical constructs. 

It reviews basic Maple commands and reserved symbols, as well lists some com- 
mon functions that arise in engineering. Various operations are described involving 
differentiation, integration, and the solution of a single ordinary differential equa- 
tion or systems of differential equations. Problems in linear algebra are also briefly 
considered involving basic matrix operations such as addition, subtraction, and mul- 
tiplication of matrices. Also provided are techniques for evaluating the transpose, 
determinant, and inverse of a matrix, as well as the solution of a linear system. 

For the syntax of the Maple commands listed below, a “Text” input button is cho- 
sen (this input style also encompasses earlier versions of the software). The “Text” 
button option is found in the legend showing separate buttons for: “Text,” “Math,” 
“Drawing,” “Plot,” and “Animation.” It is located just above the worksheet and for- 
matting legend on the graphical user interface. 


A.1 Maple commands 


Arithmetic operations 


Operation Symbol | Example 
Addition + 2+2 
Subtraction — 10—x 
Multiplication | x Z* YZ 
Division / KZ 
Exponent **ora XAZ 
Factorial ! 10! 


Reserved symbols: 1 — Pi V-l1—> I 
e— exp(1) oo — infinity 
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Functions 

exp(x), In(x), log10(x), sqrt(x), abs(x) 

sin(x), cos(x), tan(x), cot(x), sec(x), csc(x) 

arcsin(x), arccos(x), arctan(x), arccot(x),arcsec(x), arccsc(x) 

sinh(x), cosh(x), tanh(x), coth(x), sech(x), csch(x) 

arcsinh(x), arccosh(x), arctanh(x), arccoth(x), arcsech(x), arccsch(x) 
GAMMA(x), BesselJ(v, x), Bessel Y(v, x), Bessell(v, x), BesselK(v, x) 


Note: > is the Maple prompt after which is what you type: 
> expr := (x A 2) + (y A 3); (one hits enter after the semicolon) 


Differentiation (first way) 


diff(expression, variables) 


> diff(expr1, x); 
> diff(expr1, x, y); 
> diff(expr1, x, x); 


Differentiation (second way) 

> f :=x— > (x ^A 5) + (x A 4); (— is the minus symbol followed by an inequality 
symbol) 

> DOQ) Ge. f"@)); 

> (D@ @3)(f)(x) (ie. f(x); 


Integration 

> expr! := exp(a * x) * cos(b * x); 

> int(expr1, x); 

> diff(%, x); (% refers to the last result) 


Summation of series 
> sum(f,i=k..n); 


Commands 

> simplifyQ; (simplifies an expression) 

> evalf(); (numerically evaluates functions and parameters) 
> expand(); (expand an expression) 

> factor(); (factor an expression) 


Differential equations 


dsolve(equation, dependent variable(independent variable)) 


For the differential equation: (x? y’ = y? + 5xy + 4x?) this can be written as 


Se ___——— 
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> eql := (x A 2) x diff(y(x), x) = (yA 2) + SG «xx y) +4 (KA 2); 
> dsolve(eql, y(x)); 


> eq2 := (D@ @4)(y)(x) +4 * y(x) = 0; 
> dsolve(eq2, y(x)); 


For an initial value problem (note: it is important to use only integers with no floating 
points! (>? topic;)): 

> eq3 := (D@ @3)(y)(x) — D(y)(x) = 0; 

> ({eq3, y(O) = 6, Diy) (0) = —4, (D@ @2)(y) (0) = 2}, y(x)); 


Systems of differential equations 

> sys {D(y) (x) = 4 x y2(x), D(y2) (x) = 4 x yl (x) +2—16*xA 2}; 
> fens := {yl (x), y2(x)}; 

> dsolve(sys, fens); 


> sys{D(y1)(x) = y1 (x) + 2 * y2(x), D(y2)(x) = —8 * y1 (x) + 11 * y2(x), 
y1(0) = 1, y2(0) = 7}; 

> fens := {y1(x), y2(x)}; 

> dsolve(sys, fens); 


Linear algebra 


One must load the “linalg” package first 


>with(linalg): 
—-1 —2 4 
Given the matrix: A=] 2 —5 2 
3 —4 6 
> A:=matrix(3,3,[-1, —2, 4, 
2, —5, 25 
3, —4, —6]); 
The returned result is 
—-1 —2 4 
A=| 2 —5 2 
3 —4 —6 


Addition and subtraction of matrices 
>5xA—3x«B; 


Multiplication of matrices 
> A.B; 
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Transpose of matrix 
> transpose(A); 


Inverse of matrix (see Example 4.2.1 and Example 4.3.4) 
> inverse(A); 


-19 14 -8 
23 23 23 
=9 3 =) 
23. 23 23 
-7 5 -9 
46 23 46 
Determinant of matrix 
> det(A); 
—46 
Solution of a linear system Ax = b 
For example (see Example 4.3.1): 
-1 —2 4 x] =3 
2 -5 2 “|=| 7 
3 —4 —-6]| | x3 5 
> b := vector([—3, 7, 5]); 
b:= [-3, 7, 5] 
> x := linsolve(A, b); 
x:= [5,1,1] 


An alternative method to solve the system of equations is with a Gauss elimination 
technique (see Example 4.3.3): 
> Al := augment (A, b); (augmented matrix of the system Ax = b) 


af -2 4 3 
Al=|2 -5 2 7 
3-4 -6 5 


> bl := gausselim(A1); (A1 triangularized by row operation) 
-1 —2 4 —3 
bl = 0 —9 10 1 
—46 —46 
Oe ee e 


oo (CCCCCCCOCLU 
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> xl := backsub(b1); 
xl:=[5,1, 1] 


Solution of a linear system in Example 6.2.2: 


—4 1 1 0 ull 1 
1 -4 0 1 uzn | _ | —3 
2 0 —4 1 U12 = —1 
0 2 1 —4] | u22 —8 


> A:=Matrix([[ —4, 1, 1 0], 

[1, —4, 0 1], 

[2, 0, —4 1], 

[0, 2, 1 —4]); 
Note that a slightly different format is employed for inputting the matrix (where the 
size of the matrix is not included). Here, the matrix is input using the automated 
“Insert Matrix” command in Maple. The returned result is 


—4 1 1 0 

1 -4 0 1 

a 2 0 -4 1 
0 2 1 —-4 


> b:= vector([1, —3, —1, —8]); 
b:= [1, -3, =A; —8] 


> u := linsolve(A, b); 
pe 267 206 507) 
161° 161° 161° 161 


u:= 
> evalf(%); 
[0.4844720497 1.658385093 1.279503106 3.149068323] 


Fourier series analysis (see Example 2.5.15) 
> Cn i= 1/(2 x Pi) x int(exp(—Pi * x) x exp(—I * n * x), x = —Pi..Pi); 


1 (eras _ 1) e77 -Inn 


Cn = 


2 a(x +In) 


> simplify(%); 
1 et (a+In) _ et (atin) 


2 u(x +In) 


APPENDIX 


Geodesic formulation 


The following appendices provide the supporting analysis for derivation of the 
geodesic, in terms of a brief definition of tensors in Section B.1 and the derivation of 
the geodesic equation using the formalism of the Lagrangian in Section B.2. 


B.1 Tensors 


The metric g,» is in fact a covariant tensor of rank two (since there are two indices). 
A scalar is a tensor of rank zero (having only a magnitude), while a vector is a ten- 
sor of rank one (having both a magnitude and direction) (see Section 4.4.1). For a 
change in coordinate system from an unprimed to a primed one, the corresponding 
transformation laws for a covariant tensor (lower indices) and contravariant tensor 
(upper indices) of rank two are given as (Lieber and Lieber, 1966): 


Oxy axs . 
ax), xy 7” 


A 


e covariant tensor: Aag = 


e contravariant tensor: A’ %8 = 


ƏXu OXy 
One can also multiply a covariant tensor by a contravariant one to yield a mixed tensor 
: , Ixa OX), 
with the transformation law C = E a £ ; 
Ox, Oxg 


The physical difference for a covariant and contravariant tensor is depicted in 
Fig. B.1. Here a vector V is represented in a nonorthogonal plane Cartesian sys- 
tem of coordinates in the Euclidean plane, with (normed) basis vectors â and &. As 
seen in the figure, the covariant components (x1, x2) of V are the perpendicular pro- 
jections (dot product of each of the basis vectors) on the coordinate axis, in contrast 
to the contravariant components (x!, x?) of V, which are parallel projections. 
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(x,,X,) Covariant components 


(x?,x?) Contravariant components 
FIGURE B.1 
Geometric representation for covariant and contravariant components of a vector. 


B.2 Lagrangian and action 


It is easier to deal with L? than L in Eq. (14.24) because of the square-root sign in 
Eq. (14.23) (d‘Inverno, 1992). Multiplying Eq. (14.24) through by 2L gives 


aL) 4, d ak 


=0. B.1 
OXp dh dx oo 
l ue a(L3] .. 
Using the chain rule successive times for the expression — - gives 
dà | xu 
d [a(L? d aL dL ðL d [ aL 
( ; ) = 2L—— |=2 —+2L - (B.2a) 
dà | əxa dix OX dà ðxu dà Loxp 
d [ aL d [ a(L? dL ðL 
a oe ee oe a, (B.2b) 
dh | aX, di | dx, di dX, 
Substituting the expression in Eq. (B.2b) into Eq. (B.1) yields 
a(L?) d [a(L? dL dL 
(°) ( - ) = —. (B.3) 
OX dà | dXp dà ðxu 
— — 
(1) aD ID 


Inserting L from Eq. (14.22a) into each of the three terms in Eq. (B.3) gives 


So eee 
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ð a es 
Ud) = gy Sapxoxpl = [dn SapXaxzl, 
Xu 


d 9 >.. d. dir. dig. 
II — = = ———= ees 
D dh Ox [apraxz] dh [Sap dtp XB + Sap dtp Xa] 
we we 
Sou Bu 
d ; . d » , d8ua . 
= q leup + Suara = g 28uere] = 28naXa +2 aL Xa 
Wade (B.4) 
& ð dxs 
= 28 ua Xa +2 A dh Xa, 
sae 
dsg ua Xs 
dL ə — dL 1 ie 0 oe 
UD = 2 pa, VSA] = 2715 apni? 5 Beteko 
TŘ 
28unXn 


The last term in (ZI) arises from the result of the intermediate expression in (/7) with 
a change of indices from « to 7. Moreover, for the derivation of Eq. (B.4), the Kro- 


necker delta function 6); = | ; F : has been used (also 6;; = 5). A change 
of index variable is also employed in some cases when the repeated index indicates a 


0 T 
dummy variable for summing. In addition, the function 0; g;; = Sij . There is also a 


symmetry that can be exploited for the metric and delta functions such that g;j = gji 
and bij = Ôji. 


From Eq. (14.22b), 


À 
s=[ L(xXa, ža, 4) dd 
ho 

dS) f 


> S) = = = L(xa, ža, à)| = L(xa, ža, à) — L (Xa, Ža, 0). 
dd wo 


.. dL 
Hence, S(A) = T and the third term (//J) in Eq. (B.4) can be written using 
Eq. (14.22b) as 


HID = 2— 8unn: (B.5) 


Since the path length S = A for the parametrization of the curve in Fig. 14.2, 5 =0 
in Eq. (B.5). Thus, the third term (ZI) vanishes in these equations. This result also 
follows by recalling that for the variation of the action 5S = 0 in Eq. (14.6) so that 


OOOO 
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ôS dS 

ah x T = 0. Using the final results of Eq. (B.4) with the term (//7) = 0, Eq. (B.3) 

becomes 
Ou 8opXaXpB = 28 axa = 2953 SwaXaXs =0 (B.6a) 
=> 28paxXa + XaxXsl—In Sas + 2038 pa] = 0 (B.6b) 
=> 28paxXa + XaxXsl—In Sas + 3ga + sgnal = 0. (B.6c) 


The expression in Eq. (B.6b) was obtained by multiplying through by —1, changing 
the dummy subscript of 6 to ô, and collecting terms. Moreover, since œ and ô are 
repeat (dummy) indices, these indices can be interchanged in the last term in square 
brackets in Eq. (B.6c), yielding 


P aa, 1 1 1 
2guaža + 2kaksl— 5 Op 80s + 5 068na + zagu] =0 (B.7a) 
i Ea 1 1 1 
> 28naXa + 2kaksl— 5 Ou8as F z8ua F 7 9a 83u] =0. (B.7b) 
In Eq. (B.7a), a factor of 2 was factored outside of the square brackets. For the last 


term of Eq. (B.7b), the u and ô subscripts were interchanged because of the symmetry 
of the metric. Multiplying Eq. (B.7b) through by the contravariant tensor g”” yields 


Za” guata + Zkaks 581810 + agsu — Iu8as] = 0. (B.8) 
Dia 
The term 
Iie = OTe + dagu — Mas] (B.9) 


is identified as the Christoffel symbol. There is a symmetry in the lower symbols of 
Py. In addition, g" guo = ôw so that Eq. (B.8) reduces to the final equation of a 
geodesic: 


Ñ, + až r}, =0. (B.10) 
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